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PRODUCTION OF RECOMBINANT POLYPEPTIDES 



BY_ BOVINE SPECIES ANDTKSNSGENIC METHODS^ 



a continuation-in-part of U.S. patent 



^apg l^a^ioin^^ 



Serial No. 08/077,788, filed June 15, 1993 /y | which is a 
continuation-in-part of U.S. patent application Serial 



No. 07/898,956, filed June 15, 1992 r/ ( which is y a 
5 continuation-in-part of U.S. patent application Serial 



continuation-in-part of U.S. patent application Serial 
No. 07/444,745 filed December 1, 1989 (now abandoned) . 
Each of the above applications is incorporated by 
10 reference in its entirety for all purposes. 

Field of the Invention 

The invention relates to the production of recombinant 
polypeptides by transgenic bovine species and to methods 
for producing transgenic non-human mammals having a 
15 desired phenotype. 

Background of the Invention 

There is a plethora of literature relating to the 
expression of heterologous genes in lower organisms such 
as unicellular bacteria, yeast and filamentous fungi, 

0 and in higher cell types such as mammalian cells. There 
are also numerous reports on the production of 
transgenic animals, most of which relate to the 
production of transgenic mice. See, e.g., U.S. Pat. No. 
4,736,866 (transgenic mice containing activated 

5 oncogene); Andres, A., et al. (1987) Proc. Natl. Acad. 
Sci J _JJSA 84 : 1299-1303 (HA-RAS oncogene under control of 
whey acid protein promoter) ; Schoenberger , C.A. , et al. 
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(1987) Experientia 43 :644 and (1988) EMBO J. 7 : 169-175 
(C-myc oncogene under control of whey acid protein 
promoter); and Muller, W.J., et al. (1988) Cell 54 : 105- 
115 (C-myc oncogene under control of the mouse mammary 
5 tumor virus promoter) . Several laboratories have also 
reported the production of transgenic porcine species 
(Miller, K.F., et al. (1989) J . Endocrin. 120 :481-488 
(expression of human or bovine growth hormone gene in 
transgenic swine); Vize, P.D., et al. (1988) J. Cell 
10 Sci . 9_0: 295-3 00 (porcine growth hormone fusion gene in 
transgenic pigs); and Ebert, K. et al. (1988) Mol. 
Endocrin. 2:277-283 (MMLV-rat somatotropin fusion gene 
in transgenic pigs) ) , transgenic sheep (Nancarrow, et 
al. (1987) Theriogenoloav .27:263 (transgenic sheep 
15 containing bovine growth hormone gene) Clark, A.J. et 
al. (1989) Bio/Technoloav 7:487-482 and Simons, J., et 
al. (1988) Bio/Technology fi= 179-183 (human factor IX and 
Qt-1 antitrypsin CONA in ovine species) , and rabbit 
(Hanover, S.V., et al. (1987) Deutche Tierarztliche 
20 Wochens chrift 94,: 476-478 (production of transgenic 
rabbits by injection of uteroglobin-promoter-CAT fusion 
gene into fertilized rabbit oocytes) . A number of 
reports have also suggested the production of transgenic 
cattle (Wagner, et al. (1984) Therioaenolocrv 21 ; 29-441 
25 with one reporting some progress in microinjection 
techniques (Lohse, J.K. , et al. (1985) Therioaenolocrv 
23.:205). However, little, if any, success has been 
achieved in producing transgenic cows. Scientific 
articles which clearly demonstrate the actual production 
3 0 of a transgenic cow capable of producing a heterologous 
protein are presently unknown. This, despite the 
statements that one transgenic cow was produced in 
Canada which expressed human 0- interferon (Van Brunt, 
J- (1988) Bio/Technology fi; i uq-i i ^) and that transient 
35 'expression of human a-f etoprotein in liver and blood was 
obtained on one occasion (Church, R.B. (1986) 
Biotech nology News Watch 6 (15), 4). One reference 
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reports that bovine papilloma virus was apparently 
integrated but not expressed in a transgenic cow 
(Roschlau, et al. (1988) Arch . Tierz . , Berlin 31 : 3-8) , 
A recent article has summarized the genetic engineering 
5 of livestock. (Pursel, V.G. et al. (1989.) Science 
244 : 1281-1288) . 

A number of laboratories have reported tissue-specific 
expression of DNA encoding various proteins in the 
mammary gland or the production of various proteins in 

10 the milk of transgenic mice and sheep. For example , 
Simmons, J. P., et al. (1987) Nature 328:530-532 report 
the microinjection of a 16.2 kb genomic fragment 
encoding /3-lactoglobulin (BLG) including 4 kb of 5 1 
sequence, 4 . 9 kb of the BLG transcription unit and 

15 7.3 kb of 3' flanking sequence into fertilized mouse 
eggs. According to these authors, the sheep BLG was 
expressed in mammary tissue and produced BLG in the milk 
of the transgenic mice at concentrations ranging from 
about 3.0 to about 23 mg/ml. When, however, cDNA 

2 0 encoding human factor IX or human al-antitrypsin was 

inserted into the 5 ' untranslated region, of the BLG gene 
and microinjected into sheep (Simmons, J. P., et al. 
(1988) Bio /Technology 6:179-183) the production of 
factor IX or ocl-antitrypsin was significantly reduced 
25 (25ng/ml for factor IX and lOmg/ml for al-antitrypsin; 
see Clark, A.J. , et al. (1989) Bio / Techno locry 7:487- 
492) . 

In a similar approach, a 14 kb genomic clone containing 
the entire 7.5 kb rat 0-casein together with 3.5 kb of 

3 0 5 1 and 3.0 kb of 3* flanking DNA was reportedly 

microinjected into fertilized mouse oocytes. Lee, et 
al. (1988) Nucl. Acids Res. 16:1027-1041. Yet, in this 
case, the level of expression of the rat jS-transgene in 
the lactating mammary gland of transgenic mice was 
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reported to be at a level of 0.01-1% of the endogenous 
mouse /3-casein gene. 

Human tissue plasminogen activator (t-PA) reportedly was 
produced in transgenic mouse milk at the levels between 
0,2 and about 0.4/ig/ntl when a cDNA encoding a human t-PA 
with its endogenous secretion sequence was expressed 
under control of a 2.6* kb 5' sequence of the murine whey 
acid protein gene. Gordon, K. , et al. (1987) 
Bio /Technology 5.: 1183-1187 . Subsequent experiments 
using the same or similar construction reportedly 
produced t-PA in different mouse lines arranging from 
less than 2 0ng of t-PA per ml of milk to about 50/xg/ml. 
Pittius, C.W. , et al. (1988) Proc. Natl. Acad. Sci. USA 
85 : 5874-5878 . 

U.S. Patent No. 4,873,316 issued October 10, 1989, 
discloses the use of 9 kb of 5 1 sequence from the bovine 
aSl casein gene including the casein signal peptide and 
several casein codons fused to a mature t-PA sequence. 
The transgenic mice obtained with this construct 
reportedly produced about 0 . 2-0 . 5/ig/ml of a t-PA fusion 
protein in their milk. 

In addition, a number of patent publications purportedly 
describe the production of specific proteins in the milk 
of transgenic mice and sheep. See, e.g. European Patent 
Publication No. 0 264 166 published April 20, 1988 
(hepatitis B surface antigen and t-PA genes under 
control of the whey acid promoter protein for mammary 
tissue specific expression in mice) ; PCT Publication No. 
W088/00239 published January 14, 1988 (tissue specific 
expression of a transgene encoding factor IX under 
control of a whey protein promoter in sheep) ; PCT 
Publication No. W088/ 01648 published March 10, 1988 
(transgenic mouse having mammary secretory cells 
incorporating a recombinant expression system comprising 
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a bovine a-lactalbumin gene fused to interleukin-2 ) ; 
European Pat. Pub. No. 0 279 582 published August 24, 
1988 (tissue-specific expression of chloramphenicol 
acetyltransf erase under control of rat /3-casein promoter 
5 in transgenic mice) ; and PCT Pub. No. W088/ 10118 
published December 29, 1988 (transgenic mice and sheep 
containing transgene encoding bovine aSl casein promoter 
and signal sequence fused to t-PA) . 

Given the state of the transgenic art, it is apparent 
10 that a need exists for methods which enable the 
efficient production of transgenic mammals, especially 
transgenic mammals other than transgenic mice. 

Further, it is apparent that a need exists for methods 
for producing transgenic bovine species which are 
15 capable of producing recombinant polypeptides such as 
human milk proteins and human serum proteins in the milk 
of such transgenic mammals. 

Accordingly, it is an object herein to provide methods 
for detecting the transgenesis of fertilized oocytes 

2 0 prior to implantation. 

In addition, it is an object herein to provide 
transgenic bovine species which are capable of producing 
recombinant polypeptides which are maintained 
intracellularly or are secreted extracellular ly. 

25 It is also an object herein to provide transgenic bovine 
species which are capable of producing recombinant 
polypeptides such as human milk proteins and human serum 
proteins in the milk of such transgenic animals. 

Further, it is an object herein to provide milk from a 

3 0 transgenic bovine species containing such recombinant 

polypeptides. 
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Still further, it is an object herein to provide food 
formulations supplemented with recombinant polypeptides 
from such transgenic milk such as human infant formula 
supplemented with human lactoferrin. 

5 Further, it is an object herein to provide transgenes 
which are capable of directing the production of 
recombinant polypeptides in the milk of transgenic 
bovine species. 

The references discussed herein are provided solely for 
10 their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to 
antedate such disclosure by priority based on earlier 
filed applications, 

15 Summary of the Invention 

In accordance with the above objects, the invention 
includes transgenes for producing recombinant 
polypeptides in the milk of transgenic bovine species. 
The production of such transgenic bovine milk containing 

2 0 one or more recombinant polypeptides is desirable since 

it provides a matrix wherein little or no purification 
is necessary for human consumption. The transgene 
comprises a secretory DNA sequence encoding a secretory 
signal sequence which is functional in mammary secretory 
25 cells of the bovine species of interest and a 
recombinant DNA sequence encoding the recombinant 
polypeptide. These sequences are operably linked to 
form a secretory-recombinant DNA sequence. At least one 
expression regulation sequence, functional in the 

3 0 mammary secretory cells of the bovine species, is 

operably linked to the secretory-recombinant DNA 
sequence. The transgene so constructed is capable of 
directing the expression of the secretory-recombinant 
DNA sequence in mammary secretory cells of bovine 
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species containing the transgene. 



Such expression 



produces a form of recombinant polypeptide which is 
secreted from the mammary secretory cells into the milk 
of the transgenic bovine species. 

In addition, the invention includes methods for 
producing such transgenic bovine species. The method 
includes introducing the above transgene into an 
embryonal target cell of a bovine species, transplanting 
the transgenic embryonic target cell formed thereby into 
a recipient bovine parent and identifying at least one 
female offspring which is capable of producing the 
recombinant polypeptide in its milk. 

The invention also includes transgenic bovine species 
capable of producing recombinant polypeptides in the 
milk of lactating females of said species, the milk from 
such transgenic bovine species containing such 
recombinant polypeptides and food formulations 
containing the transgenic milk in liquid or dried form, 
as well as food formulations supplemented with one or 
more recombinant polypeptides from such transgenic milk* 

In addition to the foregoing, the invention includes 
transgenes and transgenic bovine species containing 
transgenes that are capable of producing a recombinant 
polypeptide. Such transgenes are similar to the 
aforementioned transgenes for milk secretion and are 
characterized by having an expression regulation 
sequence which targets the expression of the DNA 
encoding the recombinant polypeptide to a particular 
cell or tissue type, e.g. expression of human serum 
albumin in the liver of a transgenic bovine species. 
When the recombinant polypeptide is to be secreted from 
such targeted cells or tissues, a secretory DNA sequence 
encoding a secretory signal sequence functional in the 
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particular targeted cell or tissue is operably linked 
to the recombinant DNA sequence encoding the recombinant 
polypeptide, e.g. secretion of human serum albumin from 
bovine liver into the bovine circulatory system. 

5 Further, the invention includes methods for producing 
transgenic non-human mammals having a desirable 
phenotype. The method comprises first causing the 
methylation of a transgene capable of conferring the 
desirable phenotype when incorporated into the cells of 

10 a transgenic non-human animal, e.g., by transforming an 
appropriate bacterium, such as E. coli MM 294, with a 
plasmid containing the transgene. The methylated 
transgene is then excised and introduced into fertilized 
oocytes of the non-human animal to permit integration 

15 into the genome. The oocytes are then cultured to form 
pre-implantation embryos thereby replicating the genome 
of each of the fertilized oocytes. Thereafter, at least 
one cell is removed from each of the pre-implantation 
embryos and treated to release the DNA contained 

2 0 therein. Each of the released DNAs are then digested 

with a restriction endonuclease capable of cleaving the 
methylated transgene but incapable of cleaving the 
unmethylated form of the transgene formed after 
integration into and replication of the genomic DNA. 
25 Those pre-implantation embryos which have integrated the 
transgene contain DNA which is resistant to cleavage by 
the restriction endonuclease in the region containing 
the transgene. This resistance to digestion , which can 
be detected by electrophoresis of the digest after PGR 

3 0 amplification of the DNA and hybridization with a 

labelled probe for the transgene, facilitates the 
identification of successful transgenesis . 

The invention also includes a method to produce a 
population of transgenic offspring having the same 
35 genotype.* This method utilizes a specific embodiment 




of the above method for detecting early transgenesis . 
In this method , a methylated transgene is introduced 
into fertilized oocytes which are cultured to pre- 
implantation embryos. Thereafter, each pre-implantation 
embryo is divided to form first and second hemi-embryos . 
Each of the first hemi-embryos are then analyzed for 
transgenesis as described above. After identifying 
successful transgenesis in at least one first 
hemi-embryo, the second untreated hemi-embryo which 
contains the integrated transgene, is cloned to form a 
multiplicity of clonal transgenic blastocysts or hemi- 
blastocysts, each of which have the same genotype. The 
transgenic embryos are thereafter transplanted into one 
or more recipient female parents to produce a population 
of transgenic non-human mammals having the same 
genotype . 

Brief Description of the Drawings 

The accompanying drawings , which are incorporated in and 
form a part of the specification, illustrate embodiments 
of the present invention and, together with the 
description, serve to explain the principles of the 
invention. In the drawings: 

Fig. 1^ depicts the DNA (Seq. ID No. : 1) and amino acid 
(Seq. ID No.: 2) sequence for a human lactoferrin clone 
derived from a human mammary cDNA library as described 
herein except that the sequence between nucleotides 
1557-1791 and 2050-2119 corresponds to the previously 
published sequence (Rado et al. (1987) Blood 70:989- 
993). 

Fig. 2 depicts the complete DNA (Seq. ID No. : 3) and 
amino acid (Seq. ID No. : 4) sequence of human 
lactoferrin including 5' and 3 1 untranslated sequence 
as well as the complete human lactoferrin signal 
sequence. 
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Fig. 3 is a restriction map of a clone of a 5 '-flanking 
region of bovine otSl casein gene. 

Fig. 4 is a restriction map of a clone of a 3' -flanking 
region of bovine aSl casein gene. 

5 Figs. 5A, 5B and 5C depict the construction of 
pSIS'S'CAT and pSI5'CAT. 

Fig. 6 depicts pMH-1. 

Figs. 7A through 7F depict the construction of 
expression vectors containing sequences encoding human 
10 lactoferrin. 

Fig. 8 depicts the genome of human serum albumin, the 
fragments used to generate transgenic mice contained in 
this genomic DNA and the identification of the fragment 
sizes which would be obtained upon the digestion of 
15 genomic DNA from a transgenic mouse with the restriction 
enzymes BstE-II and Nco-I or with Nco-I and Hindi-Ill. 

Fig. 9 depicts an alternate pathway for the construction 
of a transgene of the invention encoding human 
lactoferrin. 

20 Fig. 10 depicts the construction of a plasmid pPC 
containing a transgene encoding Protein C. 



intervening sequence used in a preferred embodiment of 



25 (shown in lower case) consists of the 5 1 -end of IVS-1 
from bovine aSl casein (from position +54 to +180 with 
respect to the start of transcription) fused to the 3 1 - 
end of a human IgG splice sequence. The Hind III site 
(in bold type and underlined) derives from the IgG 




sequence for a hybrid 



the invention. 



The predicted intervening sequence 
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sequence and marks the junction between the aSl and IgG 
splice sequences* The 5 1 -end upper case sequence 
depicts the complete exon one of the bovine aSl casein 
gene. The 3 1 -end upper case sequence represents the 
5 splice junction of the IgG gene through to the Pst I 
site (CTGCAG) incorporated in the cloning vector, pMHl. 

Fig. 12A - is a restriction map of a bovine ocSl casein 
*c^pS promoter hLF cDNA transgene. 

Fig. 12B shows a Southern blot analysis of DNA isolated 
10 from various bovine and murine tissues using an hLF cDNA 
probe . 

Fig. 13 depicts restriction maps of hLF genomic clones 
13.1 and 13.2. 

Fig. 14 depicts the BamHI fragment from genomic hLF 
15 subcloned into plasmid pUC19 . 

Fig. 15A depicts a restriction map of the 8hLFgen9k or 
16hLFgen9k construct containing the 8 or 16 kb aSl 
casein promoter, a Clal-Apal synthetic linker and the 
9 kb (i.e., 8.9 kb) Apal-Sall genomic hLF fragment. 

(^^.20 Fig. 15B^ depicts the DNA sequence of the Clal-Apal 
synthetic sequence shown in Fig. 15A. 



Fig. l&^rdepicts the IVS and the structure of exon 1 and 
part of exon 2 of the genomic hLF construct shown in 
Fig. 15A through Fig. 17. 



25 Fig. 16 depicts the coinjection of the Notl-Sall 
fragment from the 8hLFgen9k or 16hLFgen9k construct (as 
shown in Fig. 15A) with the 3' Clal fragment of genomic 
hLF. 



^ig. 17; depicts the g< 
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Fig. 17| depicts the generation of a genomic 8hLF 
transgene by linking the Notl-Mlul fragment from the 
8hLFgen9k construction (shown in Fig, 15A) , the Mlul- 
Clal fragment from clone 13.2 depicted in Fig, 13 and 
a Clal-NotI linker. Fig. 17 also depicts the DNA 
seguence of the Clal-NotI linker. 



igs. 18-20 depict the generatic 




£ Figs. 18j-2 0 depict the generation of the /3LG-hLFgen and 
)8LG-hLFgen3 7 constructs . 

Fig. 21 depicts the design of the 16,8hLZ expression 
10 vector. 

Fig. 22 depicts the design of the 16,8hLZ3 expression 
vector. 

Fig. 2 3A-^£ depict the pathway for the construction of 
plasmid 8hLZ. 



15 Fig. 2 4 depicts a.comparison between the DNA of bovine 
pLG fand sheep pLG . * The top sequence represents the 
bovine sequence. 

Fig. 25p3hows the linker GP 278/279. 
Fig* 2 6 depicts the pl6,8A hLZ3 expression vector. 
20 Pig. 27 depicts the 16, A hLZ3 expression vector. 



Detailed Description of the Invention 

The "non-human mammals" of the invention comprise all 
non-human mammals capable of producing a "transgenic 
non-human mammal" having a "desirable phenotype". Such 
25 mammals include non-human primates, murine species, 
bovine species, canine species, etc. Preferred non- 
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human animals include bovine, porcine and ovine species, 
most preferably bovine species. 

Desirable phenotypes for transgenic non-human mammals 
include, but are not limited to, the production of 
5 recombinant polypeptides in the milk of female 
transgenic non-human mammals, the production of animal 
models for the study of disease, the production of 
animals with higher resistance to disease (e.g. diseases 
of the mammary gland such as mastitis) and the 
10 production of recombinant polypeptides in the blood, 
urine or other suitable body fluid or tissue of the 
animal. In the preferred embodiments, transgenic bovine 
species are disclosed which are capable of producing 
recombinant human lactoferrin, human serum albumin and 
15 human Protein C in the milk of lactating females or 
human serum albumin in the liver of the transgenic 
animal . 

The transgenic non-human mammals of the invention are 
produced by introducing a "transgene" into an embryonal 
target cell of the animal of choice. In one aspect of 
the invention, a transgene is a DNA sequence which is 
capable of producing a desirable phenotype when 
contained in the genome of cells of a transgenic non- 
human mammal. In specific embodiments, the transgene 
comprises a "recombinant DNA sequence" encoding a 
"recombinant polypeptide". In such cases, the transgene 
is capable of being expressed to produce the recombinant 
polypeptide. 

As used herein, a "recombinant polypeptide" (or the 
3 0 recombinant DNA sequence encoding the same) is either 
a "heterologous polypeptide" or a "homologous 
polypeptide". Heterologous polypeptides are 

polypeptides which are not normally produced by the 
transgenic animal. Examples of heterologous 
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polypeptides include human milk proteins such as 
lactoferrin, lysozyme, secreted immunoglobulins, 
lactalbumin, bile salt-stimulated lipase, etc., human 
serum proteins such as albumin, immunoglobulins, Factor 
5 VIII, Factor IX, protein C, etc. and industrial enzymes 
such as proteases, lipases , chitinases, and liginases 
from procaryotic and eucaryotic sources. The 
recombinant DNA sequences include genomic and cDNA 
sequences encoding the recombinant polypeptide. 

10 When recombinant DNA sequences encoding a heterologous 
polypeptide are used, the transgene may be integrated 
in a random manner into the genome of the species used 
for transgenesis . As disclosed in the Examples, 
transgenes encoding human lactoferrin, human serum 

15 albumin and human Protein C in conjunction with a aSl 
casein secretory signal sequence under control of aSl 
casein expression regulation sequences are designed to 
produce and secrete these heterologous polypeptides from 
the mammary gland of a lactating transgenic mammal into 

2 0 its milk. 

As used herein, a homologous polypeptide is one which 
is endogenous to the particular transgenic species. 
Examples of endogenous polypeptides from bovine species 
include bovine milk proteins such as ctSl, aS2 , /?- and 
25 #c-casein, j3-lactoglobulin lactoferrin, lysozyme, 
cholesterol hydrolase, serum proteins such as serum 
albumin and proteinaceous hormones such as growth 
hormones. When recombinant DNA sequences encoding a 
homologous polypeptide are used, the transgene is 

3 0 preferably integrated in a random manner into the genome 

of the species used for transgenesis. Such random 
integration results in a transgenic animal which 
contains not only the transgene encoding the endogenous 
polypeptide but also the corresponding endogenous 
35 genomic DNA sequence. Accordingly, such transgenic non- 
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human mammals are readily characterized by an increase 
in the copy number of genes encoding the endogenous 
polypeptide. Further, the transgene will generally be 
located at a position which is different from the 
endogenous gene. 

When DNA encoding a homologous polypeptide is expressed, 
for example, in bovine species, the transgenic animal 
is characterized by an increase in the amount of the 
homologous polypeptide in either the endogenous tissue 
or fluid in which it is normally found and/or by its 
presence in a tissue and/or body fluid which either does 
not normally contain the homologous polypeptide or 
produces it at significantly lower levels. 

Thus, for example, bovine cholesterol hydrolase is 
normally present in the colostrum for about the first 
15-2 0 days of lactation. This naturally occurring 
endogenous polypeptide increases calf weight. This 
protein, however, is also a homologous polypeptide when, 
for example, its expression in mammary secretory cells 
is placed under the control of expression regulation 
sequences, such as those obtained from bovine casein 
genes, which facilitate the expression of the homologous 
polypeptide beyond the lactation period that it is 
normally present. Thus, according to one aspect of the 
invention, bovine cholesterol hydrolase expression is 
maintained in transgenic bovine milk by placing the 
expression of cholesterol hydrolase recombinant DNA 
(either cDNA or genomic) under the control of bovine aSl 
casein expression regulation sequences. When a genomic 
recombinant DNA is used, it is engineered such that it 
has appropriate restriction sites (e.g. Clal and Sail) 
at the 5 • and 3 1 end of the structural gene such that 
it is capable of being inserted into an appropriate 
transgene genomic cassette (e.g. p-16 kb, CS which is 
described in Example 15) . Alternatively, a recombinant 
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DNA encoding bovine cholesterol hydrolase derived from 
cDNA may be placed under control of bovine aSl casein 
expression regulation sequence by substituting the human 
lactoferrin sequences in a plasmid such as pl6, 8HLF3 
5 (containing a hybrid intervening sequence) or pl6, 8HLF4 
(containing a homologous aSl casein intervening 
sequence) . When these particular plasmids are used, the 
cDNA clone is engineered such that it has appropriate 
Clal and Sail restriction sites at the ends of the 
10 recombinant DNA. 

By way of further example, bovine lactoferrin is 
normally present in only trace amounts in cow's milk. 
When, however, bovine lactoferrin is expressed under 
control of other regulatory sequences, for example, 
obtained from an orSl casein gene, higher amounts of 
lactoferrin in the milk of transgenic bovine species are 
obtained. In another example, a transgene comprising 
DNA encoding homologous bovine growth hormone is 
incorporated into the bovine genome to confer superior 
growth characteristics to the transgenic animal. In 
other instances, homologous polypeptides include, for 
example, a polypeptide which normally is maintained 
intracellularly in a particular species but which is 
secreted into the milk or other extracellular 
compartment of the transgenic species, such as the 
circulatory system . 

Each of the heterologous or homologous polypeptides are 
characterized by specific amino acid and nucleic acid 
sequences. It is to be understood, however, that such 
3 0 sequences include naturally occurring allelic variations 
thereof and variants produced by recombinant methods 
wherein such nucleic acid and polypeptide sequences have 
been modified by the substitution, insertion and/or 
deletion of one or more nucleotides in such nucleic 
35 acids to cause the substitution, insertion or deletion 
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of one ore more amino acid residues in the recombinant 
polypeptide ♦ 

When expression of the DNA of the transgene is necessary 
to generate a desired phenotype, e.g. to produce a 
recombinant polypeptide, the transgene typically 
includes at least a 5' and preferably additional 3* 
"expression regulation sequences" each operably linked 
to a recombinant or secretory-recombinant DNA as defined 
hereinafter. Such expression regulation sequences in 
addition to controlling transcription also contribute 
to RNA stability and processing, at least to the extent 
they are also transcribed. 

Such expression regulation sequences are chosen to 
produce tissue-specific or cell type-specific expression 
of the recombinant or secretory-recombinant DNA. Once 
a tissue or cell type is chosen for expression, 5 1 and 
optional 3 ' expression regulation sequences are chosen. 
Generally, such expression regulation sequences are 
derived from genes that are expressed primarily in the 
tissue or cell type chosen. Preferably, the genes from 
which these expression regulation sequences are obtained 
are expressed substantially only in the tissue or cell 
type chosen, although secondary expression in other 
tissue and/or cell types is acceptable if expression of 
the recombinant DNA in the transgene in such tissue or 
cell type is not detrimental to the transgenic animal. 
Particularly preferred expression regulation sequences 
are those endogenous to the species of animal to be 
manipulated. However, expression regulation sequences 
from other species such as those from human genes may 
also be used. Particularly preferred expression 
regulation sequences from human genes are human 
lactoferrin (hLF) sequences. In some instances, the 
expression regulation sequences and the recombinant DNA 
sequences (either genomic or CDNA) are from the same 



species, e.g., each from bovine species or from a human 
source. In such cases, the expression regulation 
sequence and the recombinant DNA sequence are homologous 
to each other. Alteratively , the expression regulation 
sequences and recombinant DNA sequences (either cDNA or 
genomic) are obtained from different species, e.g., an 
expression regulation sequence from bovine species and 
a recombinant DNA sequence from a human source) . In 
such cases, the expression regulation and recombinant 
DNA sequence are heterologous to each other. The 
following defines expression regulation sequences from 
endogenous genes. Such definitions are also applicable 
to expression regulation sequences from non-endogenous, 
heterologous genes . 

In general, the 5' expression regulation sequence 
includes the transcribed portion of the endogenous gene 
upstream from the translation initiation sequence (the 
5' untranslated region or 5* UTR) and those flanking 
sequences upstream therefrom which comprise a functional 
promoter. As used herein, a "functional promoter" 
includes those necessary untranscribed DNA sequences 
which direct the binding of RNA polymerase to the 
endogenous gene to promote transcription. Such 
sequences typically comprise a TATA sequence or box 
located generally about 25 to 3 0 nucleotides from the 
transcription initiation site. The TATA box is also 
sometimes referred to the proximal signal. In many 
instances, the promoter further comprises one or more 
distal signals located upstream from the proximal signal 
(TATA box) which are necessary to initiate 
transcription. Such promoter sequences are generally 
contained within the first 100 to 200 nucleotides 
located upstream from the transcription initiation site, 
but may extend up to 500 to 600 nucleotides from the 
transcription initiation site. Such sequences are 
either readily apparent to those skilled in the art or 
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readily identifiable by standard methods . Such promoter 
sequences alone or in combination with the 5 1 
untranslated region are referred to herein as "proximal 
5' expression regulation sequences". 

In addition to such proximal 5 1 expression regulation 
sequences, it is preferred that additional 5 1 flanking 
sequences (referred to herein as "distal 5' expression 
regulation sequences") also be included in the 
transgene. Such distal 5' expression regulation 
sequences are believed to contain one or more enhancer 
and/or other sequences which facilitate expression of 
the endogenous gene and as a consequence facilitate the 
expression of the recombinant or secretory-recombinant 
DNA sequence operably linked to the distal and proximal 
5' expression regulation sequences. The amount of 
distal 5' expression regulation sequence depends upon 
the endogenous gene from which the expression regulation 
sequences are derived. In general, however, such 
sequences comprise 5 1 flanking regions of approximately 
1 kb, more preferably 16 kb and most preferably about 
3 0 kb of 5' flanking sequence. The determination of the 
optimal amount of distal 5 1 expression regulation 
sequence used from any particular endogenous gene is 
readily determined by varying the amount of distal 5 1 
expression regulation sec[uence to obtain maximal 
expression. In general, the distal 5' expression 
regulation sequence will not be so large as to extend 
into an adjacent gene and will not include DNA sequences 
which adversely effect the level of transgene 
expression. 

In addition, it is preferred that 3' expression 
regulation sequences also be included to supplement 
tissue or cell-type specific expression. Such 3* 
expression regulation sequences include 3 1 proximal and 
3 1 distal expression regulation sequences from an 
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appropriate endogenous gene* The 3' proximal expression 
regulation sequences include transcribed but 
untranslated DNA positioned downstream from the 
translation stop signal in the recombinant DNA. sequence 
(also referred to as the 3' untranslated region or 3 1 
UTR) • Such sequences generally terminate at a 
polyadenylation sequence (either from the endogenous 
gene or from other sources such as SV4 0) and sequences 
that may affect RNA stability. Generally, 3 1 UTR's 
comprise about 100 to 500 nucleotides downstream from 
the translation stop signal in the gene from which the 
3 1 regulation sequence is derived. Distal 3' expression 
regulation sequences include flanking DNA sequences 
downstream from the proximal 3 1 expression regulation 
sequence. Some of these distal sequences are 

transcribed, but do not form part of the mRNA while 
other sequences in this distal 3 ' expression regulation 
sequence are not transcribed at all. Such distal 3 1 
expression regulation sequences are believed to contain 
enhancer and/ or other sequences which enhance 
expression. Such sequences are believed to be necessary 
for efficient polydenylation and contain transcription 
termination sequences Preferably, such sequences 
comprise about 2 kb, more preferably 8 kb and most 
preferably about 15 kb of 3 • flanking sequence. 

A preferred 3 1 flanking sequence is the 3 f flanking 
sequence of the human lactoferrin (hLF) gene. Transgenic 
animals containing transgenes that include about 9 kb 
of hLF 3 1 flanking sequences show enhanced expression 
of recombinant polypeptides in milk compared to animals 
containing transgenes that include 1 kb or less of hLF 
3 1 flanking sequence, due to an enhancer or other 
enhancing sequence located in this region. Usually the 
human lactoferrin 3 ' flanking sequence will be at least 
1 kb in length up to about 9 kb in length or longer, 
typically 3 to 7 kb, more typically 4 to 5 kb. It will 
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also be possible, and sometimes desirable, to use 
standard methods (e.g., deletion analysis) to identify 
regions contained within the 9 kb 3 • flanking sequence 
that enhance mammary gland expression of recombinant 
5 polypeptides. These enhancers or enhancing sequences can 
be isolated and used in combination with various amounts 
of homologous or heterologous sequences. Typically the 
enhancing sequences can range in length from about 50 
basepairs to about 2 kb, more typically from about 100 
10 basepairs to about 500 basepairs. 

It will often be desirable to use a trahsgene having a 
5 1 expression regulation sequence and a 3 1 flanking 
sequence that originate from the same gene. In a 
preferred embodiment, the 5' expression regulation 
15 sequence and 3 ' flanking sequence are from the bovine 
aSl-casein gene. 

In an alternative embodiment a genomic sequence, such 
as a human genomic clone or clones, can be introduced 
into an animal to produce a transgenic animal containing 
a transgene that has the sequence of the human gene, 
including all or part of the 5 ■ expression regulation 
sequences, coding sequences, introns, and 3' 
untranslated and flanking sequences. In a preferred 
embodiment, the human lactoferrin genomic sequence is 
used in its entirety, but various components can be 
substituted with components from other mammary gland 
specific genes. 

Although the use of both 5 1 and 3 1 expression regulation 
sequences are preferred, in some embodiments of the 
3 0 invention, endogenous 3 1 regulation sequences are not 
used. In such cases, the 3 1 proximal expression 
regulation sequences normally associated with the 
genomic DNA encoded by the recombinant DNA sequence are 
used to direct polyadenylation. In addition, distal 3' 
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regulation sequences from the genomic DNA encoding the 
recombinant polypeptide may also be employed preferably 
in the same amounts as set forth for endogenous 3 1 
expression regulation sequences. In such cases, it is 
to be understood that the recombinant polypeptide 
encoded by the transgene may comprise either genomic DNA 
or a double stranded DNA derived from cDNA. As with the 
5 f expression regulation sequences, the optimal amount 
of 3 • expression regulation sequence may be readily 
determined by varying the amount of 3 ■ flanking sequence 
to obtain maximal expression of the recombinant 
polypeptide. In general, the distal 3' regulation 
sequence, be it from an endogenous gene or a 
heterologous gene, will not extend into the adjacent 
gene from which is derived and will exclude any 
sequences which adversely effect the level of transgene 
expression* 



Examples of expression regulation sequences are provided 
in Table I . 
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Expression Regulation 
Sequence ' 

16 kb of bovine aSl 
casein 5 ■ to structural 
gene and 8 kb 3 • to 
structural gene 

«15 kb 5' to albumin 
gene 

»15 kb 5' to a-actin 
gene 

«15 kb upstream of 
protamine gene 



TABLE 1 

Tissue 
Specificity 

Mammary 
secretory 
cells 



Liver 

Muscle 

Spermatids 



Animal 
Species 

bovine 



murine 



murine 



murine 



In addition to the 5 1 and 3 1 expression regulation 
sequences and the recombinant DNA (either genomic or 




derived from cDNA) the transgenes of the invention 
preferably also comprise a "recombinant intervening 
sequence" which interrupts the transcribed but 
untranslated 5' region of the transgene. Such 
intervening sequences can be derived, for example, from 
bovine aSl casein and from human lactoferrin. Such 
sequences as used herein are "homologous recombinant 
intervening sequences" in that the 5* and 3 1 RNA splice 
signals in such recombinant intervening sequences are 
those normally found in an intervening sequence from an 
endogenous or heterologous gene. Recombinant 
intervening sequences may, however, also comprise a 
"hybrid intervening sequence". Such hybrid intervening 
sequences comprise a 5' RNA splice signal and 3' RNA 
splice signal from intervening sequences from different 
sources. In some aspects of the invention, such hybrid 
intervening sequences comprise at least one "permissive 
RNA splice sequence". As used herein, a permissive RNA 
splice signal is an RNA splice signal sequence, 
preferably a 3« RNA splice signal, from an intron 
contained within a repertoire of germ line DNA segments 
which undergo rearrangement during cell differentiation. 
Examples of such gene repertoires include the 
immunoglobulin super gene family, including the 
immunoglobulins and T-cell antigen receptors as well as 
the repertoire of the major histocompatibility complex 
(MHC) genes and others. Particularly preferred 
permissive splice sequences are those obtained from the 
immunoglobulin repertoire, preferably of the IgG class, 
and more preferably those 3 1 splice signal sequences 
associated with the J-C segment rearrangement of the Ig 
heavy and light chain, most preferably the heavy chain. 
A particularly preferred permissive splice sequence 
comprises that portion of the sequence as shown 
downstream of the Hindlll site in Fig. 11. A 
particularly preferred hybrid intervening sequence 
comprises the entire sequence shown in Fig. 11 which 
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includes a 5 1 portion of an intervening sequence from 
bovine aSl casein and a 3 ' sequence portion of an IgG 
heavy chain intervening sequence. 

Such hybrid intervening sequences containing permissive 
5 RNA splice signals are preferably used when the 
recombinant DNA corresponds to a cDNA sequence. As 
indicated in the Examples, when 16 kb of 5 1 expression 
regulation sequence from the orSl casein gene was used 
in conjunction with an aSl casein-IgG hybrid intervening 

10 sequence to express human lactoferrin cDNA operably 
linked to the aSl casein secretory signal sequence a 
transgenic mouse was obtained which produced 
approximately 1330 /ig/ml of hLF in the transgenic milk. 
This amount of recombinant polypeptide far exceeds the 

15 previously reported amounts for production of various 
protein in transgenic mouse milk of generally less than 
10 /xg/ml and in one case approximately 50 /ig/ml. It also 
exceeds the maximum of 8/xg/ml of hLF produced herein 
when the same transgene was used that contained a 

2 0 homologous bovine intervening sequence rather than the 

hybrid intervening sequence. 

However, such hybrid intervening sequences are not 
limited to transgenes utilizing cDNA sequence. Rather, 
hybrid intervening sequences are also useful when the 
25 recombinant polypeptide is encoded by a genomic 
sequence. Based on the results obtained with the cDNA 
recombinant DNA and the general expectation that genomic 
DNA sequences express at higher levels than sequences 
derived from cDNA, it is expected that such hybrid 

3 0 intervening sequences used in conjunction with genomic 

recombinant DNA will further enhance expression levels 
above that which would otherwise be obtained with 
genomic sequence alone. 
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Based on the foregoing, it is apparent that preferred 
transgenes include large amounts of 5 1 and 3 • expression 
regulation sequences. Further, the recombinant DNA is 
preferably derived from genomic clones which may be tens 
to hundreds of kilobases in length. Based on the 
present technology for cloning and manipulating DNA, the 
construction and microinjection of transgenes is 
practically limited to linearized DNA having a length 
not greater than about 50 kb. However, the transgenes 
of the invention, especially those having a length 
greater than about 50 kb, may be readily generated by 
introducing two or more overlapping fragments of the 
desired transgene into an embryonal target cell. When 
so introduced, the overlapping fragments undergo 
homologous recombination which results in integration 
of the fully reconstituted transgene in the genome of 
the target cell* In general, it is preferred that such 
overlapping transgene fragments have 100% homology in 
those regions which overlap. However, lower sequence 
homology may be tolerated provided efficient homologous 
recombination occurs. If non-homology does exist 
between the homologous sequence portions, it is 
preferred that the non-homology not be spread throughout 
the homologous sequence portion but rather be located 
in discrete areas. Although as few as 14 base pairs at 
100% homology are sufficient for homologous 
recombination in mammalian cells (Rubnitz, J. and 
Subramani, S. (1984) Mol. Cell. Biol. 4:2253-2258), 
longer homologous sequence portions are preferred, e.g. 
500bp, more preferably lOOObp, next most preferably 
2000bp and most preferably greater than 2 000bp for each 
homologous sequence portion. 

As indicated in the examples, three overlapping 
fragments of the human serum albumin gene were 
microinjected into the pronuclei of mouse zygotes in 
approximately equal molar portions. These fragments 




successfully recombined and integrated into the mouse 
genome as confirmed by analysis of the integrated DNA 
by Southern blotting procedures and by detection of RNA 
transcript and human serum albumin in the serum of the 
transgenic mouse. Although the transgene so generated 
has a unit length of 38 kb., there is no known practical 
limit to the size of the transgene which may be formed 
using larger and/or greater numbers of overlapping 
transgene fragments. In particular, it is expected that 
transgenes may be formed by this approach having lengths 
between about 50 to 1000 kb and more preferably between 
50 and 500 kb. Further , the use of homologous 
recombination of overlapping fragments is expected to 
be fruitful in the generation of larger transgenic 
animals, such as transgenic bovine species, containing 
transgenes incorporating recombinant DNA comprising 
genomic DNA which otherwise could not be incorporated 
into a pronucleus to form a transgenic animal. Such 
genomic transgenes are expected to produce higher 
expression levels in transgenic cows as compared to that 
which is produced by transgenes encoding recombinant 
cDNA. 

When, the ultimate object is to secrete a recombinant 
polypeptide, a "secretory DNA sequence" encoding a 
functional secretion signal peptide is also operably 
linked within the transgene to direct secretion of the 
recombinant polypeptide from one or more cell types 
within the transgenic animal. Secretory DNA sequences 
in general are derived from genes encoding secreted 
proteins of the same species of the transgenic animal. 
Such secretory DNA sequences are preferably derived from 
genes encoding polypeptides secreted from the cell type 
targeted for tissue-specific expression, e.g. secreted 
milk proteins for expression in and secretion from 
mammary secretory cells. Secretory DNA sequences, 
however, are not limited to such sequences. Secretory 
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DNA sequences from proteins secreted from other cell 
types within the species of transgenic animal may also 
be . used, e.g., the native signal sequence of a 
homologous gene encoding a protein secreted other than 
5 in the mammary glands. In addition, "heterologous 
secretory DNA sequences" which encode signal secretion 
peptides from species other than the transgenic animals 
my also be used e.g., human t-PA, human serum albumin 
human lactoferrin and human lactalbumin and secretion 

10 signals from microbial genes encoding secreted 
polypeptides such as from yeast, filamentous fungi, and 
bacteria. In general, a secretory DNA sequence may be 
defined functionally as any DNA sequence which when 
operably linked to a recombinant DNA sequence encodes 

15 a signal peptide which is capable of causing the 
secretion of the recombinant polypeptide. 

In one of the preferred embodiments, a secretory DNA 
sequence encoding a secretory signal sequence functional 
in the mammary secretory cells of bovine species is used 

2 0 to cause secretion of recombinant polypeptide from 

bovine mammary secretory cells. The secretory DNA 
sequence is operably linked to the recombinant DNA 
sequence. Examples of such secretory DNA sequences 
include DNA sequences encoding signal secretion 
25 sequences for bovine aSl casein, murine lactoferrin and 
human transferrin. The preferred secretory DNA sequence 
is that encoding the secretory sequence of crSl casein 
from bovine species. The use of this secretory DNA 
sequence is described in more detail in the Examples. 

3 0 "Operably linked" in the context of linking a secretory 

DNA sequence to a recombinant DNA sequence means that 
the secretory DNA sequence (comprising codons encoding 
the secretory signal peptide sequence) is covalently 
coupled to the recombinant DNA sequence so that the 
3 5 resultant secretory-recombinant DNA sequence encodes 5' 
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to 3 ' for the secretory signal sequence and recombinant 
polypeptide. Accordingly, the reading frame for the 
secretory sequence and the recombinant DNA sequence must 
be covalently combined such that an open reading frame 
5 exists from the 5 1 end of the mRNA sequence formed after 
transcription and processing of the primary RNA 
transcript. This open reading frame in the RNA contains 
a 5 1 sequence portion encoding the secretory signal 
peptide and a 3 1 sequence portion encoding the 

10 recombinant polypeptide. When so constructed, the 
recombinant polypeptide produced upon expression of the 
secretory-recombinant DNA sequence is of a form which 
is capable of being secreted from targeted cells which 
express the DNA sequence. The signal peptide generally 

15 is removed in vivo during secretion to produce an 
extracellular form of the recombinant polypeptide. 

In the preferred embodiments of the invention, a 
secretory-recombinant DNA sequence is expressed 
predominantly in the mammary secretory cells of 

2 0 transgenic bovine species. Such tissue-specific 

expression is obtained by operably linking mammary 
specific expression regulation DNA sequences to the 
above secretory-recombinant DNA sequence. Such mammary 
specific regulation sequences include the aforementioned 
25 regulation sequences contained in various bovine genes 
preferentially expressed in the mammary secretory cells 
of the species. Such mammary specific genes include aSl 
casein; aS2-casein; jS-casein; K-casein; a-lactalbumin; 
and /3-lactoglobulin. Preferred expression regulation 

3 0 sequences are derived from aSl casein as described more 

in detail in the Examples. 

In general, the transgenes of the invention that are 
designed to secrete the recombinant polypeptide into 
transgenic bovine milk are capable of causing such 
35 secretion at levels significantly higher than that 



previously reported for transgenic mice and sheep. When 
the recombinant polypeptide is encoded by a recombinant 
DNA corresponding to, or derived from, cDNA, the molar 
concentration of the recombinant polypeptide is 
preferably greater than about 1.0 /iM, more preferably 
greater than about 100 jiM, and most preferably greater 
than 100 /iM. When viewed from the perspective of the 
level of recombinant polypeptide present in the 
transgenic milk, the amount of recombinant polypeptide 
is preferably greater than 50 tig/ml, more preferably 
greater than about 500 /ig/ml and most preferably greater 
than about 1000 Mg/ml (lmg/ml) . 

When the transgene of the invention encodes a 
recombinant polypeptide that is encoded by recombinant 
DNA derived from or corresponding to genomic DNA (or 
comprised substantially of such genomic sequences, e.g. 
greater than about 50%, more preferably greater than 
about 75%, most preferably greater than 90% of the 
codons encoding the recombinant polypeptide are from 
genomic sequences) , the molar concentrations and protein 
levels in bovine transgenic milk are the same as for 
cDNA or higher. In general, the molar concentration 
of the recombinant polypeptide in such transgenic milk 
is preferably greater than about 50 jiM, more preferably 
greater than about 150 mM, most preferably greater than 
about 500 /iM. When viewed from the level of protein in 
the transgenic milk, the levels are preferably greater 
than about 10 mg/ml, more preferably greater than about 
2.5 mg/ml, most preferably greater than 5 mg/ml. 

The foregoing molar concentration and protein levels in 
bovine transgenic milk will vary depending upon the 
molecular weight of the particular recombinant 
polypeptide. A particular advantage of producing a 
recombinant polypeptide in bovine transgenic milk is 
that relatively large molecular weight polypeptides may 
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be so produced which are otherwise difficult to produce 
in large quantities in other systems such as prokaryotic 
expression systems. Although any recombinant 

polypeptide may be produced in bovine transgenic milk 
5 according to the invention, it is generally preferred 
that such recombinant polypeptides have a molecular 
weight greater than about 10,000 Daltons. However, 
other recombinant polypeptides having molecular weights 
of greater than 15,000, greater than 2 0,000 and greater 

10 than 60,000 Daltons may also be expressed in transgenic 
bovine milk. For example, human lysozyme having a 
molecular weight of 17,000 Daltons and lactoferrin 
having a molecular weight of 79,000 Daltons may be 
readily produced in the transgenic milk of bovine 

15 species according to the disclosure of the invention. 
Thus, the recombinant polypeptides of the invention have 
a wide range of molecular weights. 

As a consequence, the foregoing preferred molar 
concentrations of recombinant polypeptides are adjusted 

20 when higher molecular weight recombinant polypeptides 
are produced. Such adjustment is made by converting the 
molar concentration to the amount of protein produced 
and adjusting the molar concentrations so that the 
recombinant protein level is within the following 

25 preferred concentrations. 

Most of the previous reports relating to the production 
of polypeptides in transgenic milk involve transgenic 
mice* The mouse, however, normally produces between 55 
to 80 milligrams of protein per ml of milk. A cow, on 

3 0 the other hand, normally produces between 3 0 to 34 
milligrams of protein per ml. Since exceptionally high 
levels of recombinant polypeptide production may 
adversely affect the production of endogenous milk 
protein and/or have adverse effects upon the mammary 

35 secretory gland, it is preferred that the recombinant 
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polypeptide concentration be between about 3 and 50% of 
the normal bovine milk protein concentration (i.e. , 
between about 1 and 17 milligrams of recombinant 
polypeptide per ml of transgenic milk) , more preferably 
between 10 to 20% (i.e., between 3 to about 7 milligrams 
per ml) and most preferably between 10 and 15% (i. e. , 
between about 3 and 5 milligrams per ml) of the normal 
amount of protein produced in bovine milk. Such 
preferred ranges also provide a preferred maximum limit 
to the aforementioned levels of protein produced in 
transgenic bovine milk. 

The above described linking of various DNA sequences to 
form the transgene of the invention are performed by 
standard methods known to those skilled in the art or 
as described herein. Once the transgene or overlapping 
homologous fragments encoding the transgene are 
constructed as described they are used to make 
transgenic non-human animals. 

Methods of introducing transgenes or overlapping 
transgene fragments into embryonal target cells include 
microinjection of the transgene into the pronuclei of 
fertilized oocytes or nuclei of ES cells of the non- 
human animal. Such methods for murine species are well 
known to those skilled in the art. Alternatively, the 
transgene may be introduced into an animal by infection 
of zygotes with a retrovirus containing the transgene 
(Jaenisch, R. (1976) Proc. Natl. Acad. Sci. USA 73 :1260- 
12 64) . The preferred method is microinjection of the 
fertilized oocyte. In this preferred embodiment, the 
fertilized oocytes are first microinjected by standard 
techniques. They are thereafter cultured in vitro until 
a "pre-implantation embryo" is obtained. Such pre- 
implantation embryos preferably contain approximately 
16 to 150 cells. The 16 to 32 cell stage of an embryo 
is commonly referred to as a morula. Those pre- 
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implantation embryos containing more than 32 cells are 
commonly referred to as blastocysts. They are generally 
characterized as demonstrating the development of a 
blastocoel cavity typically at the 64 cell stage* 
5 Methods for culturing fertilized oocytes to the pre- 
implantation stage include those described by Gordon, 
et al. (1984) Methods in Enzvmoloav 101 : 414 : Hogan, et 
al. (1986) in Manipulating the Mouse Embryo , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N. Y. (for 

10 the mouse embryo); and Hammer, et al. (1985) Nature 
315: 680 (for rabbit and porcine embryos) Gandolfi, et 
al. (1987) J. Reprod. Fert. 81:23-28; Rexroad, et al. 
(1988) J - Anim. Sci. 66:947-953 (for ovine embryos) and 
Eyestone, W.H. et al. (1989) J". Reprod. Fert. 85:715- 

15 720; Camous., et al. (1984) J. Reprod. Fert. 72 :779-785; 
and Heyman, Y . , et al. (1987) Therioaenoloav 27:5968 
(for bovine embryos) . Such pre-implantation embryos are 
thereafter transferred to an appropriate female by 
standard methods to permit the birth of a transgenic or 

2 0 chimeric animal depending upon the stage of development 

when the transgene is introduced. As is well known, 
mosaic animals can be bred to form true germline 
transgenic animals. 

Since the frequency of transgene incorporation is often 
25 low, the detection of transgene integration in the pre- 
implantation embryo is highly desirable. In one aspect 
of the invention methods are provided for identifying 
embryos wherein transgenesis has occurred and which 
permit implantation of transgenic embryos to form 

3 0 transgenic animals. In this method, one or more cells 

are removed from the pre-implantation embryo. When 
equal division is used, the embryo is preferably not 
cultivated past the morula stage (32 cells) . Division 
of the pre-implantation embryo (reviewed by Williams et 
35 al. (1986) Theriocr enoloav 22:521-531) results in two 
"hemi-embryos" (hemi-morula or hemi-blastocyst) one of 
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which is capable of subsequent development after 
implantation into the appropriate female to develop in 
utero to term. Although equal division of the pre- 



5 that such an embryo may be unequally divided either 
intentionally or unintentionally into two hemi-embryos 
which are not necessarily of equal cell number. 
Essentially, all that is required is that one of the 
embryos which is not analyzed as hereinafter described 
10 be of sufficient cell number to develop to full term in 
utero . In a specific embodiment, the hemi-embryo which 
is not analyzed as described herein, if shown to be 
transgenic, is used to generate a clonal population of 
transgenic non-human animals. 

15 One of each of the hemi-embryos formed by division of 
pre-implantation embryos is analyzed to determine if the 
. transgene has been integrated into the genome of the 
organism. Each of the other hemi-embryos is maintained 
for subsequent implantation into a recipient female of 

20 the species. A preferred method for detecting 
transgenesis at this early stage in the embryo's 
development uses these hemi-embryos in connection with 
a unique property of the restriction endonuclease Dpn 
1. This enzyme recognizes the sequence GATC in double- 

25 stranded DNA but only when the adenine in each strand 
within this sequence is methylated at N-6 . When using 
this preferred method, the transgene containing the 
sequence GATC is methylated . prior to microinjection 
either by transferring the transgene on an appropriate 

3 0 plasmid through a DAM + strain of microorganisms such as 
EU. coli MM294 or by directly methylating the transgene 
with dam methylase. The methylated transgene 

(preferably without any exogenous sequences such as 
plasmid vector) is then microinjected into fertilized 

3 5 oocytes (approximately 10 to 500 copies per pronucleus, 
more preferably 50 to 100 copies per pronucleus) • The 



implantation embryo is preferred, it is to be understood 
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f ertilized oocytes so obtained are cultured in vitro to 
the pre-implantation stage. During this early growth 
and cell division phase, the genomic DNA is replicated. 
Accordingly, those copies of the methylated transgene 
5 integrated into the genome of the fertilized oocyte are 
unmethylated after replication whereas any non- 
integrated transgenes which may still exist after 
replication will remain methylated. (Lacks, S., et al. 
(1977) J. Mol. Biol. 114:153.) This differential 
10 methylation pattern for integrated versus non- integrated 
transgene permits the identification of which fertilized 
oocytes have integrated the transgene into the genome. 

The identification of the pre-implantation embryos 
containing the integrated transgene is achieved by 

15 analyzing the DNA from each of the hemi-embryos . Such 
DNA is typically obtained by lysing the hemi-embryo and 
analyzing the thus released DNA after treatment as 
described by Ninomiy, T. et al. (1989) Molecular 
Reproduction and Development 1:242-248. Each of the DNA 

20 samples is treated with D pn I . Thereafter, a polymerase 
chain reaction (Saiki, et al. (1985) Science 230:1350- 
1354) is preformed to amplify all or part of the 
transgene. When the entire transgene is amplified, two 
extension primers each complimentary to opposite strands 

25 at opposing ends of the transgene are used for 
amplification. When, however, less than the entire 
transgene is amplified, such extension primers are 
chosen such that the amplified gene product spans the 
DPn I site in the transgene « If Dpn I cleavage has not 

3 0 occurred, PGR amplification results in amplified 
sequences having a predetermined size whereas primer 
extension for those transgenes which have been cleaved 
will not result in exponential amplification. 
Generally, the Dpn I/PCR amplified DNA from the hemi- 

3 5 embryo is subjected to electrophoresis followed by 
hybridization with labeled probe complimentary to the 
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region of the transgene between the two extension 
primers. This facilities the determination of the size 
of the amplified DNA sequences, if any, and provides an 
indication of whether the transgene has been integrated 
5 into the pre-implantation embryo from which the hemi- 
embryo was obtained (now called a "transgenic hemi- 
embryo" ) . If it has, the remaining untreated transgenic 
hemi-embryo is transplanted into a recipient parent. 
After in utero development, the transgenic non-human 

10 animal having the desired phenotype conferred by the 
integrated transgene is identified by an appropriate 
method in utero or after birth. Of course, other 
restriction endonucleases capable of cleaving a 
methylated DNA sequence but incapable of cleaving the 

15 unmethylated form of a recognition sequence may be used 
in the aforementioned method. 

The above described method using Dpn I requires that the 
sequence GATC be present in the transgene of interest. 
In those cases when such a sequence is not present, it 

2 0 may be readily introduced into the transgene by site 

directed mutagenesis (Kunkel, T. A. (1985) Proc. Natl. 
Acad. Sci. 82:48fl) or cassette mutagenesis (Wells, J. A. , 
et al. (1985) Gene 3_4:315) provided such mutagenesis 
does not change the amino acid sequence encoded by the 
25 transgene (or causes an inconsequential change in amino 
acid sequence) and that any codons so generated are 
functional in the transgenic non-human animal of 
interest. 

The above described methods for the detection of 

3 0 transgenesis in pre-implantation embryos provide 

economical and time saving method for generating 
transgenic non-human animals since they significantly 
decrease the number of pregnancies required to produce 
a transgenic animal and substantially increase the 
3 5 likelihood that an implanted embryo will produce a 
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transgenic non-human animal. Such methods are 

especially important for those animals for which very 
low or non-existent frequencies of transgenesis have 
been obtained, e.g. bovine species. 

In an alternate embodiment, the above described method 
for detecting transgenesis in pre-implantation embryos 
is combined with embryonic cloning steps to generate a 
clonal population of transgenic embryos which may 
thereafter be implanted into recipient females to 
produce a clonal population of transgenic non-human 
animals also having the same genotype. In this regard, 
it is to be understood that transgenic embryos and/ or 
non-human transgenic animals having the same "genotype" 
means that the genomic DNA is substantially identical 
between the individuals of the embryo and/ or transgenic 
animal population. It is to be understood, however, 
that during mitosis various somatic mutations may occur 
which may produce variations in the genotype of one or 
more cells and/or animals. Thus, a population having 
the same genotype may demonstrate individual or 
subpopulation variations. 

After a hemi-embryo is identified as a transgenic hemi- 
embryo, it is cloned. Such embryo cloning may be 
performed by several different approaches. In one 
25 cloning method, the transgenic hemi-embryo is cultured 
in the same or in a similar media as used to culture 
individual oocytes to the pre-implantation stage. The 
"transgenic embryo" so formed (preferably a transgenic 
morula) is then divided into "transgenic hemi-embryos" 
3 0 which can then be implanted into a recipient female to 
form a clonal population of two transgenic non-human 
animals. Alternatively, the two transgenic hemi-embryos 
obtained may be again cultivated to the pre-implantation 
stage, divided, . and recultivated to the transgenic 
35 embryo stage. This procedure is repeated until the 
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desired number of clonal transgenic embryos having the 
same genotype are obtained. Such transgenic embryos may 
then be implanted into recipient females to produce a 
clonal population of transgenic non-human animals. 

5 In a preferred cloning method , the transgenic embryo is 
cloned by nuclear transfer according to the techniques 
of Prather, etal. (1988) Biol. Reprod. 37 :59-86; Roble, 
et al. (1987) J . Anim. Sci. .64.: 642-664. According to 
this method, nuclei of the transgenic embryo are 

10 transplanted into enucleated oocytes, each of which is 
thereafter cultured to the blastocyst stage. At this 
point, the transgenic embryos may be resubjected to 
another round of cloning by nuclear transplantation or 
may be transferred to a recipient parent for production 

15 of transgenic offspring having the same genotype. 

In addition to the foregoing methods for detecting early 
transgenesis , other methods may be used to detect 
transgenesis . Such methods include in utero and post 
partum analysis of tissue. In utero analysis is 

2 0 performed by several techniques. In one, transvaginal 

puncture of the amniotic cavity is, performed under 
echoscopic guidance (Bowgso, et al. (1975) Bet. Res. 
96:124-127; Rumsey, et al. (1974) J. Anim. Sci. 
39.: 386-391) . This involves recovering about 15 to 20 
25 milliliters of amniotic fluid between about day 3 5 and 
day 100 of gestation. This volume of amniotic fluid 
contains about 1000 to 12,000 cells per ml originating 
from the urogenital tract, the skin and possibly the 
lungs of the developing embryo. Most of these cells are 

3 0 dead. Such cells, however, contain genomic DNA which 

is subjected to PCR analysis for the transgene as an 
indication of a successful transgenesis. Alternatively, 
fetal cells may be recovered by chorion puncture. This 
method also may be performed transvaginally and under 
3 5 echoscopic guidance. In this method, a needle is used 




-38- 



10 



15 



20 



25 



to puncture the recipient animal's placenta , 
particularly the placentonal structures, which are fixed 



performed around day 60 of gestation in bovine species. 
Chorion cells, if necessary, are separated from maternal 
tissue and subjected to PCR analysis for the transgene 
as an indication of successful transgenesis . 

Transgenesis may also be detected after birth. In such 
cases, transgene integration can be detected by taking 
an appropriate tissue biopsy such as from the ear or 
tail of the putative transgenic animal. About one to 
two centimeters of tail or about five to ten square 
millimeters of ear are obtained followed by southern 
blotting with a probe for the transgene according to the 
method of Hogan, et al. (19 86) Manipulating the Mouse 
Embrvo . Cold Spring Harbor Laboratory. 

Transgenesis can also be determined by using the 
southern blot technique with DNA obtained from other 
tissues. In particular, semen from a recombinant bull 
will be useful for identifying transgenic animals. 

Transgenesis may also by detected by assaying for 
expression of the recombinant polypeptide in a tissue, 
secretion (e.g. , saliva) , or other body fluid. In the 
case where the goal is expression of a recombinant 
polypeptide in milk of cows it will be especially useful 
to assay the saliva of bulls for expression levels. This 
is because some mammary specific promoters may also 
cause salivary giand expression, albeit at low levels. 
See, e.g., Archibald et al . (1990) Proc. Nat. Acad. Sci. 
USA 87Z:5178-5182. 

In those embodiments where a recombinant polypeptide is 
expressed and secreted into the milk of transgenic 
bovine species, the transgenic milk so obtained may be 



against the vaginal wall. 



Such sampling may be 




either used as is or further treated to purify the 
recombinant polypeptide. This depends, in part, on the 
recombinant polypeptide contained in the transgenic milk 
and the ultimate use for that protein. Thus, when the 
recombinant polypeptide is secreted into transgenic milk 
to increase the nutritional value of the bovine milk, 
no further purification is generally necessary. An 
example of such a situation involves one of the 
preferred embodiments wherein human lactoferrin is 
produced in the milk of bovine species as a supplement 
to control intestinal tract infections in newborn human 
infants and to improve iron absorption. In other 
situations, a partial purification may be desired to 
isolate a particular recombinant polypeptide for its 
nutritional value. Thus, for example, human lactoferrin 
produced in transgenic bovine milk may be partially 
purified by acidifying the milk to about pH 4-5 to 
precipitate caseins. The soluble fraction (the whey) 
contains the human lactoferrin which is partially 
purified. 

The recombinant polypeptide contained in bovine 
transgenic milk may also be used in food f ormulations . 
A particularly useful food formulation comprises an 
infant formula containing one or more recombinant 
polypeptides from transgenic bovine milk which have 
either nutritional or other beneficial value. For 
example, an infant formula containing human lactoferrin 
from transgenic bovine milk made according to the 
present invention provides a bacteriostatic effect which 
aids in controlling diarrhea in newborn. Similarly, 
recombinant polypeptides such as human casein and human 
lysozyme may also be generated in transgenic bovine milk 
to provide nutritional value. Table 2 sets forth the 
constituents of a typical infant formula. As indicated 
therein, the protein content varies between about 1.8 
and 4.5 grams of protein per 100 kilocalories of 
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formula. Thus, the total protein including recombinant 
polypeptide should lie between the values at least based 
on regulatory requirements in the United States from 
which the formulation in Table 2 is based. The amount 
5 of total protein including recombinant polypeptide , of 
course, may vary from the foregoing depending upon the 
local regulations where the particular formula is 



intended to be used. 




Nutri ent 



Mini mem 



Maximum*^ 



Protein (gm> 



1.8 b 



4.5 



Fat: 
gm 

percent cal 



3.3 
30.0 



6.0 
54.0 



Essential fatty acids (linoleate): 
percent cal 
mg 



2.7 
300.0 



Vitamins: 
(A) <IU) 
D (IU> 
K (M9) 
E (IU) 

C (ascorbic acid (mg) 
8, (thiamine C/ig) 
B 2 (riboflavin) (;ig) 
B 4 (pyridoxine) (/ig) 

B, 2 (M9) 
Niacin (;tg) 
Folic acid (/ig) 
Pantothenic acid ( g) 
Biotin (^g) 
Choline (mg) 
Inositol (mg) 



250.0 ( 75 /tg) c 
40.0 
4.0 

0.7 (with 0.7 IU/gm 
lineoleic acid) 

a.o 

40.0 
60.0 

35.0 (with 15 jig/gm of 

protein in formula) 

0.15 
250.0 

4.0 
300.0 

1.5' 

7.0 d 
1 4.0< 



750.0 ( 225 fig) c 
100.0 



Minerals: 

Calcium (mg) 
Phosphorus (mg) 
Magnesium (mg) 
Iron (mg) 
Iodine (jig) 
Zinc (mg) 
Copper (M9> 
Manganese (MQ) 
Sodium (mg) 
Potassium (mg) 
Chloride (mg) 



so.d* e 

25.0^ 

6.0 

0.15 

5.0 

0.5 
60.0 

5.0 
20.0 

ao.o 

55.0 



60.0 
200.0 
150.0 



"Stated per 100 ki localor ies. 

"The source of protein shall be at least nutritionally equivalent to casein. 
"Ret i no I equivalents. 

'Requi red to be included in this amount only in formulas which are not milk-based. 
^Calcium to phosphorus ratio must be no less than 1.1 nor more than 2.0. 
includes recombinant protein according to the invention or recombinant proteins and 
other proteins. 
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In addition to infant formulas, other food formulations 
may also be supplemented with recombinant polypeptides 



recombinant polypeptides may be used to supplement 
common diet formulations. 

When the recombinant polypeptide is intended to be used 
pharmaceutically , purification methods consistent with 
such an application are called for. Such purification 
methods will depend on the particular recombinant 
polypeptide to be purified and are generally known to 
those skilled in the art. Such methods typically 
include a partial purification by casein fractionation 
followed by chromotography of the appropriate fraction 
containing the recombinant polypeptide. Such 
chromotography includes affinity chromotography, ion 
exchange chromotography, gel filtration and HPLC. 

In a specific embodiment of the invention, transgenes 
are provided for producing human lactof errin in the milk 
of transgenic bovine species. Human lactof errin (HLF) 
is a single chain glycoprotein which binds two ferric 
ions. Secreted by exocrine glands (Mason, et al. (1978) 
J. Clin. Path, 31:316-327; Tenovuo, et al. (1986) 
Infect. Immun, 51:49-53) and polymorphonuclear 
neutrophil granulocytes (Mason, et al. (1969) J. Exp. 
Med. r3_0: 643-658) , this protein functions as part of a 
host non-specific defense system by inhibiting the 
growth of a diverse spectrum ^of bacteria. HLF exhibits 
a bacteriostatic effect by chelation of the available 
iron in the media, making this essential metal 
inaccessible to the invading microorganisms (Bullen, et 
al. (1972) Br. Med. J. 1:69-75: Griffiths, et al. (1977) 
Infect. Immun, 15:396-401; Spik, et al. (1978) 
Immunology 8:663-671; Stuart, et al. (1984) Int. J. 
Biochem. 16:1043-1947) . This effect is blocked if the 
protein is saturated with ferric ions. Several studies 



from transgenic bovine milk. 



For example, such 
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suggest that HLF displays a direct bacteriocidal effect 
on certain microorganisms (Arnold, etal. (1980) Infect . 
Immun. 2.8: 893-898; Arnold, et al. (1977) Science 
197:263-265; Arnold, et al. (1981) Infect. Immun. 
5 32.: 655-660; Arnold, et al. (1982) Infect. Immun. 
15:792-797; Bortner, et al. (1986) Infect. Immun. 
5_1: 373-377) . The bacteriocidal effect is also inhibited 
by iron saturation of the protein. No mechanism for the 
bactericidal effect of HLF has been postulated, although 
10 it has been demonstrated that it can damage the outer 
membrane and alter outer membrane permeability in gram- 
negative bacteria (Ellison, etal. (1988) Infect. Immun. 
56 : 2774-2781) . 

Lactoferrin is the major iron binding protein in human 

15 milk (present at a concentration of about 1.5-1.7 mg/ml) 
and may play a role in the absorption of iron by the 
small intestine. All of the iron present in breast milk 
is thought to be bound to hLF and is taken up at very 
high efficiencies compared to formula (Hide, D.W., et 

20 al. (1981) Arch. Pis. Child. 56:172). It has been 
postulated that the high uptake of the hLF bound iron 
is due to a receptor in the jejunum and data has been 
presented suggesting existence of receptors in Rhesus 
monkeys (Cox, etal. (1979) BBA 588:120; Davidson, L.A. , 

25 et al. (1985) Fed. Proc. 18:901) . There is also 
evidence for specific lactoferrin receptors on mucosal 
cells of the small intestine of human adults (Cox, et 
al. (1979) Biochem. Bioohvs. Acta. 588 :120-128). Free 
iron levels have been implicated in the control of the 

30 intestinal flora (Mevissen-Verhage, et al. (1985) Eur. 
J. Clin. Microbiol. A: id) . Breast fed infants, compared 
with infants fed cow's milk, with and without added 
iron, were shown to have substantially reduced coliform 
and, elevated bifidobacteria and Clostridia counts in 

35 fecal samples. In in vitro studies, human milk has been 
shown to have a specific inhibitory effect on coli 
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(Brock, et al. (1983) Infect, and Immunit. 40: 453) . 
Human milk has also been shown to have a specific 
inhibitory effect on coli in small intestine due to 
its high content of iron binding protein, predominantly 
5 hLF (Bullen, et al. (1972) British Med. J. i:69). 

Thus, the production of human lactoferrin in the milk 
of transgenic bovine species provides a source of human 
lactoferrin. Such lactoferrin may be purified from the 
transgenic milk for formulation purposes, 
10 Alternatively, the whole transgenic milk may be used, 
preferably after pasteurization, in either liquid or 
dried form. In addition, the beneficial action of human 
lactoferrin may be potentiated by combining the human 
lactoferrin or the transgenic milk containing it with 
15 human lysozyme. The human lysozyme may be 

simultaneously produced in the transgenic cow by 
introducing a second transgene simultaneously with the 
HLF transgene to produce a transgenic cow capable of 
producing more than one recombinant polypeptide in the 
20 transgenic milk. Alternatively, the transgenes may be 
sequentially introduced into bovine species. When such 
is the case, a transgenic bovine species is obtained 
containing one of the transgenes. Thereafter, embryonic 
cells, such as eggs, are obtained from the transgenic 
25 female and treated so as to incorporate the second 
transgene encoding the second polypeptide. Preferably, 
the egg is fertilized, followed by microinjection of the 
pronucleus of the zygote^ so obtained. It is to be 
understood that the foregoing combination of more than 
3 0 two recombinant polypeptides in transgenic bovine milk 
is not limited to the aforementioned human lactoferrin 
and lysozyme combination. Thus, the invention 

contemplates the production of transgenic bovine species 
and transgenic milk wherein more than one recombinant 
3 5 polypeptide is produced by such a transgenic animal in 
the transgenic milk. 
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Th e complete amino acid sequence of HLF has been 
determined (Metz-Boutigue et al. (1984) Eur. J. Biochem. 
1451 : 659-676) • HLF comprises two domains, each 
containing one iron-binding site and one N-linked 
glycosylation site. These domains show homology between 
each other, indicative of an ancestral gene duplication 
and fusion event. In addition, HLF shares extensive 
homology with other members of the transferrin family 
(Metz-Boutigue, su pra ; Penteicost, et al. (1987) J. Biol. 
Chem. 262 :10134-10139) . Location of the amino acids 
involved in the iron-binding sites has been determined 
by X-ray crystallography (Anderson et al. (1987) Proc. 
Natl. Acad. Sci. 84 :17 69-1773) . A partial cDNA sequence 
for neutrophil HLF was published by Rado, et al. (1987) 
Blood TO: 989-993. There was a >98% agreement between 
the amino acid sequence deduced from the cDNA and that 
which was determined by direct analysis of lactoferrin 
from human milk. The structure of the iron.-saturated 
and iron-free form of human lactoferrin have recently 
been published. (Anderson, et al., (1989) J. Mol. Biol . 
209:711-734; Anderson, et al. ( 1990) Nature : 784-787 . ) 

As used herein, "human lactoferrin" comprises a 
polypeptide having the amino acid sequence substantially 
as described by Metz-Boutigue, et al. (1984) Eur . J . 
25 Biochem. 1451 : 659-676 and as set forth in Fig. 2. It 
is noted, however, that an earlier partial sequence of 
the human lactoferrin sequence disclosed a number of 
discrepancies between the published sequence and that 
obtained herein. Specifically, the following 

3 0 discrepancies exist (amino acid numbering is from the 
sequence in Figure 1 with DNA position in parenthesis) : 
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Amino Acid 


Position 


In Metz-Boutiaue 


Arg 


122 


(418) 


Absent 


Thr 






lie 


bin 


151 


(505) 


Arg 


Ser 


184 


(604) 


Leu 


Tyr 


189 


(619) 


Lys 


Ser 


372 


(1169) 


TrP 


between Ala 






and Met 


391 


(1122) 


13 amino acids 


Cys 


403 


(1225) 


Gly 


Gin 


512 


(1588) 


Glu 


Lys 


675 


(2077) 


Arg 



Accordingly, human lactoferrin is also defined by the 
sequence shown in Figure 1 which combines the sequence 

15 differences obtained herein with the published sequence. 
The term human lactoferrin also includes allelic 
variations of either of these sequences or recombinant 
human lactoferrin variants wherein one or more amino 
acids have been modified by the substitution, insertion 

20 or deletion of one or more amino acid residues. In some 
instances human lactoferrin may be produced in milk with 
all or part of a secretory signal sequence covalently 
attached thereto. 

As used herein, a "human lactoferrin DNA sequence" is 
25 a DNA sequence which encodes human lactoferrin as 
defined above. Such a human lactoferrin DNA sequence 
may be obtained from a human mammary gland cDNA library 
or may be derived from the human genome. Example 2 
herein describes the cloning and nucleotide sequence of 
30 human lactoferrin derived from a human mammary gland 
cDNA library. The DNA sequence of this human 
lactoferrin is shown in Fig. 1 and Fig. 2 and is 
substantially the same as that described by Rado, et al. 
(1987) Blood 70: 989-993. The construction of plasmids 
35 containing an expressible transgene encoding hLF is 
described in the examples. One of these plasmids is 
cGPlHLF also sometimes referred to as 16,8HLF3) contains 
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a transgene designed for tissue-specific expression in 
bovine mammary secretory cells. 

In a second embodiment of the invention, transgenes are 
provided for producing human serum albumin in the milk 
5 of transgenic bovine species. Human serum albumin is 
a serum protein which contains 584 amino acid residues 
(Minghetti, etal. (1986) J, Biol. Chem. 261 :67471. It 
is the most abundant protein in human serum and performs 
two very important physiological functions. Serum 
10 albumin is responsible for about 80% of the total 
osmolarity of blood and it transports fatty acids 
between adipose tissues. 

Human serum albumin is used primarily to expand plasma 
volume by restoring osmotic pressure in the circulatory 
15 system. Currently, a heat treated serum derived hSA 
fraction is infused in most shock and trauma victims, 
including most of the patients undergoing extensive 
surgery. HSA is presently derived from human blood 
plasma as a by-product from blood fractionation 

2 0 processes to obtain rare blood proteins such as factor 

VIII and IX. The recently developed technology of 
producing such factors by biotechnological means, 
however, threatens the source of human serum albumin. 

As used herein "human serum albumin" comprises a 
25 polypeptide having the amino acid sequence substantially 
as that described by Minghetti, et al., ibid ; Lawn, et 
al. (1981) Nucl. Acids Res. 9; 6103. Also included are 
variations thereof including recombinant human serum 
albumin variants wherein one or more amino acids have 

3 0 been modified by the substitution, insertion or deletion 

of one or more amino acid residues (Minghetti, et al. 
(1986) J. Biol. Chem. 261:6747-6757) . In some 

instances, human serum albumin may be produced in milk 
by expressing a transgene which contains DNA encoding 
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the secretory signal sequence of hSA. Alternatively, 
human serum albumin may be produced in and secreted from 
liver cells of a transgenic animal utilizing a 
completely heterologous transgene comprising human 
5 genomic DNA encoding 5 f expression regulation sequences, 
the human serum albumin secretion signal and structural 
gene and 3 1 expression regulation sequences. As 
indicated in the Examples, transgenes containing this 
heterologous sequence were formed by in vivo homologous 
10 recombination of overlapping transgene fragments to 
reconstitute the hSA gene in the transgenic animal. The 
so formed transgenic animal produced human serum albumin 
in its circulatory system- 

As used herein, a "human serum albumin DNA sequence" is 
15 a DNA sequence which encodes human serum albumin as 
defined above. Such a human serum albumin DNA sequence 
may be obtained from XHAL-HAI , XHAL-3W and XHAL-HI4 as 
described by Urano, et al. (198 6) J. Biol. Chem. 
£61:3244-3251 and Urano, et al. (1984) Gene 32.: 255-261 

2 0 and in the Examples herein. 

The human serum albumin DNA sequence was cloned as 
described in Example 10 herein and subsequently 
manipulated to substitute for the human lactof errin gene 
encoded in plasmid cGPlHLF (also referred to as 
25 pl6,8HLF4). From this plasmid a transgene is obtained 
containing 16 kb of the 5 1 expression regulation 
sequence of the bovine aSl casein gene, human serum 
albumin DNA sequence and approximately 8 kb of the 
3' -flanking region of the aSl casein bovine gene. This 

3 0 transgene is used to microinject fertilized oocytes from 

bovine species. After early detection of transgenesis , 
blastocysts containing the hSA transgene are implanted 
into a recipient female bovine species and brought to 
term. 
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The following is presented by way of example and is not 
to be construed as any limitation on the scope of 
the invention. 



Placental tissue was obtained from the slaughterhouse. 
10 Surrounding connective tissue was removed and pieces of 
about 3 0 grams were quickly frozen in liquid N 2 . 
Chromosomal DNA was isolated as follows: 3 0 grams of 
tissue was homogenized (on ice) with 35 ml of Buffer 1 
containing 3 00 mM Sucrose; 60 mM KC1; 15 mM NaCl; 60 mM 
15 Tris.HCl pH 8,2; 0 . 5 mM spermidine; 0,15 mM spermine; 
2 mM EDTA; 0 . 5 mM EGTA. 65 ml of icecold buffer 1 
containing 1% NP4 0 was added and the mixture was 
incubated for five minutes on ice. After centrif ugation 
for five minutes at 3 000 xg the pellet was rinsed with 
20 buffer 1 containing 1% NP40. After repeating the 
centrif ugation step the pellet was resuspended in 5 ml 
of buffer 1. 5 ml 0.5 M EDTA was quickly added. Final 
volume was now 15 ml. 0.15 ml of a 10% SDS solution was 
added. After mixing, RNAse A and Tl were added to final 
25 concentrations of 0.4 mg/ml and 6 u/ml respectively. 
After incubation at 37 °C for three hours, Proteinase K 
was added to a final concentration of 0.1 mg/ml. This 
mixture was incubated for 15 hours at 37 °C. The mixture 
was then carefully extracted with phenol. The aqueous 
3 0 phase was isolated and 1/3 0 volume of 3M NaOAc pH 5.2 
and one volume of isopropylalcohol was added. The 
precipitate (DNA) was rinsed with 70% ethanol and slowly 
dissolved in 0.5 ml of 10 mM Tris.HCl pH 8.0; 1 mM EDTA, 
at 4°C. 



EXAMPLE 1 



5 



Construction of a Probe Specific 
for Bovine aSl Casein Sequences 



A. 



Isolation of Chromosomal DNA 
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B. Amplification of Sequences from the 

5' -flanking Region of the aSl-casein Gene 

Two DNA-primers were synthesized based on the sequence 
published by Yu-Lee et al. , (1986) Nucl. Acids Res , 14 , 
1883-1902. Primer 1 was located at position-681 
relative to the major transcription initiation site and 
had the following sequence: 

5'-TCC ATG GGG GTC ACA AAG AAC TGG AC- 3 • . 
(Seq. ID No. : 5) 

Primer #2 was located at position +164 relative to the 
major transcription initiation site and had the 
following sequence: 5 1 -TGA AGC TTG CTA ACA GTA TAT CAT 
AGG-3 1 (Seq. ID. No.: 6). The first eight nucleotides 
of this primer are not encoded by the bovine genome, but 
contain a Hindlll restriction site to facilitate 
subsequent cloning steps. These primers were annealed 
to the chromosomal DNA and extended in the presence of 
deoxynucleotides by TAQ-polymerase . After three minutes 
the mixture was denatured for one minute at 92 °C, 
reannealed at 50 °C for 1.5 minutes and again incubated 
at extension temperature (68°C) for 2 minutes. This 
cycle was repeated 3 0 times. After the last cycle DNA 
was checked for the presence of the expected EcoRI 
sites. Both the size of the fragment and the presence 
of EcoRI sites was as expected. The fragment was then 
"treated with Klenow enzyme to repair any overhanging 
ends, treated with kinase to attach phosphate groups at 
the ends of the fragment, incubated at 65 °C for 10 
minutes to inactivate the kinase and klenow enzymes and 
finally digested with Hindlll. This fragment was then 
subcloned in pUC19 (Yanisch-Perron, et al. (1985), Gene , 
33, 103-109) digested with Smal and Hindlll. Formal 
proof of the identity of this fragment was obtained by 
sequencing parts of this subclone (after re-cloning into 
M13 vector) . The determined sequence was identical to 
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the published sequence* This probe was then used to 

screen a bovine genomic library to obtain clones 

specific for the 5' -flanking region of the acSl-casein 
gene. 

5 C. Amplification of Sequences from the 

3' -flanking Region of the ocSl-casein Gene 

A similar approach was taken as described above. Two 
primers were designed based on the sequence published 
by Stewart et al (1984) Nucl. Acids Res . 12 . 3895-3907. 
10 The 5* -primer was located just downstream of the coding 
sequence starting at position 713 of the cDNA sequence. 
It had the following sequence: 

5'-GAG GGA CTC CAC AGT TAT GG-3 1 (Seq. ID No.: 7). 

The other primer was located at position 107 0 of the 
15 cDNA sequence and had the following sequence: 5'-GCA 
CAC AAT TAT TTG ATA TG^3 1 (Seq. ID No.: 8). These 
primers were annealed to the chromosomal. DNA and the 
region between these primers was amplified as described 
above. The resulting fragment was ^9 00 bp longer then 
2 0 expected. Sequence analysis showed that an intervening 
sequence of this size was present between nucleotide 737 
and 738 of the cDNA. The amplified fragment was treated 
with Klenow-polymerase to repair any overhanging ends 
and treated with kinase to attach phosphate groups to 
25 the ends of the fragment. The fragment was then ligated 
into pUC19 previously cut with Smal. 





D. Screening of a Bovine Phage Library 

for aSl-casein Flanking Sequences 

A bovine genomic library, constructed in EMBL3 , was 
obtained from Dr. M. Groenen, Agricultural University 
5 Wageningen, Netherlands, and was screened in the 
following way. The bacteriophage particle titre was 
determined on Escherichia coli MB4 06 a permissive host 
strain (Stratagene Inc.). For this, several dilutions 
of the phage stock were made in SM buffer (50 mM 

10 Tris.HCl pH 7.5, 100 mM NaCl, 10 mM MgS04 , 0.01% 
gelatin) and mixed with 200 ^1 MB406 (O.D. 550 = .0.9); 
after 20 minutes at 37 °C, 3 ml top agarose (Luria- 
Bertani medium, 0.8% agarose, 10 mM MgCl 2 ) was added and 
this was plated on LB plates and incubated overnight at 

15 37°C. 

Approximately 600,000 phages were then plated by adding 
the required amount of phage stock to 400 pi MB406. The 
subsequent plating was as described as above. The next 
step was transfer of the phage to nitrocellulose 

20 filters. Plates were placed at 4°C for one hour. 
Nitrocellulose filters (S&S) were placed on the top 
agarose layer and exact position was marked. After 
lifting, the filters were soaked for (1) 3 0 minutes in 
denaturation buffer (1.5M NaCl, 0.5M NaOH) ; (2) 5 

25 minutes in neutralizing buff er (l.SMNaCl, 0.5M Tris.HCl 
pH 8.0) . After rinsing with 2xSSPE (360 mM NaCl, 20 mM 
NaH 2 P0 4/ 2 mM EDTA) , the filters were baked under vacuum 
at 80 °C for two hours. 

Prehybridization of the filters was performed in a 
30 buffer containing 50% formamide, 5x Denhardt's solution 
(0.1% Ficoll, 0.1% polyvinylpydrolidone, 0.1% bovine 
serum albumin) , SxSSPE, 0.1% SDS and 100 ng/ml denatured 
salmon sperm DNA at 42 °C for two hours. Hybridization 
was performed in same buffer at 42 °C overnight in a 
3 5 shaking waterbath. The probe, generated as previously 





described, was labelled using the Random Primed 
labelling kit: from Boehringer Mannheim. After overnight 
hybridization the filters were washed three times with 
2xSSC, 0.1% SDS at room temperature. 

5 Overnight exposure of Kodak XAR films was performed with 
amplifying screens (Dupont) at -70 °C. Putative 
positives were plugged out of the plates and put 
overnight in SM buffer at 4°C. These were plated out 
as described above and DNA was isolated following the 

10 plate lysate method (Maniatis, T. , et al. (1982), 
Molecu lar Cloning: A Laboratory Manual , Cold Spring 
Harbor, N.Y.). 5 ml SM buffer was added to the top 
agarose layer; after two hours gentle shaking buffer was 
removed and spun at 4 000 rpm at 4°C for 10 minutes. 

15 Supernatant was transferred to sterile tubes and RNase 
A and DNasel (both final concentration l^g/ml) was 
added, this was incubated at 37 °C for 3 0 miniates. One 
volume of a 2 0% polyethyleneglycol, 2.5 M NaCl solution 
was added and put on ice for one hour. Centrif ugation 

20 at 4000 rpm for 30 minutes at 4°C left precipitated 
bacteriophage particles. These were resuspended in 
500 ml SM buffer, SDS (final concentration 0.1%) and 
EDTA (final concentration 5 mM) was added, this was 
incubated at 68 °C for 15 minutes. Protein was removed 

25 with one phenol and one chloroform extraction step. 
Precipitation of phage DNA was performed with one volume 
isopropanol. Phage DNA was washed once with 70% ethanol 
and dissolved in 50 ml Tris.HCl pH 7.5, 1 mM EDTA 
buffer. 

30 Restriction enzyme analysis, agarose gel 
electrophoresis, transfer of DNA from gel to 
nitrocellulose filter and Southern blotting were all 
done according to standard procedures (Maniatis (1982) , 
Molecular Cloni ng: A Laboratory Manual ) . Hybridization 

35 with probes (described hereinafter) was performed 
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according to the same procedure as the screening 
conditions described above. 

E. Isolation of Clones Containing 

5 '-flanking Region of Bovine Sl-casein 

5 Three putative clones were identified using the probe 
and procedures as described above. After another round 
of screening, clean recombinant bacteriophage was 
analyzed. Digestion of cloned DNA with Sail . EcoRI and 
Sall/EcoRI (double digestion) and hybridization with the 

10 probe described above showed identical inserts in all 
three clones. The insert consisted of an 18 kb (partial 
Sau3A fragment excised with Sail ) . Transciptional 
orientation in the clone was determined with 
hybridization of above described restriction fragments 

15 with (1) probe 1 described above, and (2) the Ncol-Nsil 
fragment of probe 1. This showed a region of about 
16 kb upstream of transcription start* Downstream from 
the transcription start was another 1.9 kbp. Sequencing 
of part of the latter region showed the presence of exon 

2 0 2 and part of intron 2 of the bovine aSl-casein gene. 

Additional sequencing of the region-103 - +3 00 confirmed 
the identity of the clone. The ethidium-bromide pattern 
of the described restriction fragments also showed the 
orientation of the clone in the EMBL vector. Subsequent 
25 analysis of the clone with the following restriction 
enzymes (Ncol, Pstl . Kpnl . BamHI, Hindlll , BgTII) 
resulted in the restriction map of 5 1 flanking region 
of bovine Sl-casein gene as shown in Fig. 3. 

F. Isolation of Clones Containing 

3 0 3' -flanking Region of Bovine aSl-casein 

Duplicate nitrocellulose filters from the initial phage 
plating used for isolating 5 1 clones were screened with 
- the 3 1 aSl-casein probe using the same hybridization 
conditions previously described. Eight positive clones 
35 were identified after two rounds of screening. Phage 




• • •• 



-55- 

DNA was prepared as described. Subsequent restriction 
digests with Sail , EcoRI, and Sal/EcoRI and Southern 
hybridization with the 3 1 aSl probe showed identical 
inserts in seven of the eight clones. One clone 
5 containing an 18.5 kb EcoRI insert was further analyzed 
with the restriction enzymes Bstell and BamHI . A 
restriction map of that clone is shown in Fig. 4. 



EXAMPLE 2 
Cloning of Human Lactoferrin Gene 

10 A. Materials 

Restriction endonucleases, T4 ligase, and T7 
polynucleotide kinase were obtained from Boehringer- 
Mannheim, New England Biolabs, or Bethesda Research 
Laboratories. Radio-isotopes were purchased from 

15 Amersham. A human mammary gland cDNA library in 
bacteriophage Xgtll was obtained from Clontech, Inc. , 
Palo Alto, Calif. 

B. Isolation of the Human Lactoferrin Gene 

The human mammary gland library was screened by standard 

20 plaque hybridization technique (Maniatis, et al. (1982) 
Molecular Cloning: A Laboratory Manual ) with three 
synthetic oligomers. Two of the oligomers were 3 0-mers 
corresponding to the cDNA sequence of Rado et al., 
supra . at amino acid positions 436-445 and 682-691. The 

25 third was a 21-mer "best guess" probe based on human 
codon bias and coding for amino acid sequence of HLF 
between amino acid residues 18 and 24. Respectively, 
they were: 

( 1 ) 5 • — CTTGCTGTGGCGGTGGTTAGGAGATCAGAC— 3 1 ( Seq . ID 
30 No.: 9) 

(2) 5 • -CTCCTGGAAGCCTGTGAATTCCTCAGGAAG-3 ' (Seq. ID 
No. : 10) , and 

(3) 5 1 -ACCAAGTGCTTCCAGTGGCAG-3 1 ( Seq. ID No. : 11). 
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Th e probes were radiolabeled (Crouse et al. (1983) 
Methods Enzvmol . 101 . 78-98) and used to screen 
duplicate filters. Filters were washed at a final 
stringency of 2 X SSC f 37°C. 

5 C. Nucleotide Sequence Analysis 

DNA fragments were isolated by use of low-melting 
agarose (Crouse et al, s upra ) and subcloned into 
bacteriophase M13mpl8 or M13mpl9 (Messing et al. (1983) 
Methods Enzvmol , 101 , 2 0-78) . The sequence was 

10 determined using the Sequenase enzyme (modified T7 DNA 
polymerase) (Tabor et al. (1987) Proc. Natl. Acad. Sci. 
USA 84., 4767-4771) . All reactions were carried out 
according to the manufacturer's specifications (US 
Biochemicals) . The sequence is shown in Fig. 1. The 

15 hLF sequence was digested with Hindlll and EcoRI 
(present in the surrounding phage sequences) and 
subcloned into the Hindlll and EcoRI site of pUC19 to 
form pUSH9 Lacto 4.1. This clone contained the entire 
coding sequence of the mature form of hLF , but lacked 

20 the complete signal sequence. 

EXAMPLE 3 

Construction of bovine orSl-casein CAT vectors 

In order to determine whether the aSl casein fragments 
obtained in Example 1 had promoter and other properties 

25 needed to express a heterologous gene, expression 
plasmids were constructed containing variable amounts 
of 5-' and 3' -flanking regions from the aSl-casein gene. 
The chloramphenicol Acetyl transferase gene (CAT) was 
used as a heterologous gene in these vector constructs. 

3 0 The CAT gene is useful to detect the expression level 
for a heterologous gene construct since it is not 
normally present in mammalian cells and confers a 
readily detectable enzymatic activity (see Gorman, C.N. , 
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et al. (1983) , Mol. Cell. Biol, , 2, 1044-1051) which can 
be quantified in the cells or animals containing an 
expressible gene. 



681 bp of a aSl-casein promoter plus the first non- 
coding exon plus approximately 150 bp of the first 
intervening sequence (IVS) were isolated from a .5*- 
flanking genomic clone from Example 1 by PCR 
amplification as an Ncol-Hindlll fragment (approximately 
83 0 bp) . This fragment is identified as fragment 1 in 
Fig. 5A. The primer sequences consisted of: 



that were designed from a sequence published by Yu-Lee 
et al. (1986) Nuc. Acids Res . 14 . 1883-1902. 

Approximately 1.6 kb (fragment 2, Fig. 5A) of aSl-casein 
3 1 -flanking sequence was isolated by PCR amplification 
from a bovine 3' -flanking genomic clone from Example 1. 
This region contained the previously described splice 
within the 3' untranslated region of aSl-casein gene. 
Fragment 2 was subcloned into the Smal site of pUC19 . 
The primer sequences consisted of: 



A. 



DNA sequences 



5 1 -TCCATGGGGGTCACAAAGAACTGGAC-3 1 



(Seq. ID No. : 12) and 

5 • -TGAAGCTTGCTAACAGTATATCATAGG-3 1 

(Seq. ID No. : 13) 



5 1 -GAGGGACTCCACAGTTATGG-3 • 



(Seq. ID No.: 14) and 

5 1 - GCACACAATT ATTTGAT ATG - 3 1 

(Seq. ID No. : 15) 



that were designed from a sequence published by Stewart 
et al. (1984) Nucl. Acids Res . 12, 3895-3907. 
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A hybrid splicing signal comprising the 3 1 splice site 
of an immunoglobulin gene (Bothwell et al. (1981) , Cell, 
24., 625-637) was synthetically prepared and inserted 
into pUC18 along with unique restriction sites flanking 
5 either side to produce pMH-l. This plasmid is shown in 
Fig. 6. Ncol and Hindlll sites were designed such that 
ligation with fragment 1 from the bovine 5 ■ genomic 
clone would result in the functional hybrid splice 
sequence. See Fig. 11. 

10 A polyadenylation sequence was obtained from SV4 0 virus 
as a BamHI-Dral fragment (fragment 3 in Fig. 5A) 
isolated from pRSVcat (Gorman, CM. , et al. (1982), 
Proc. Natl. Acad. Sci. . 79 , 6777-6781) . 

A bacterial CAT coding sequence was subcloned into pUC19 
15 as a Pstl-BamHl fragment. 

B. Construction of pS13'5'CAT 

Fragment 1 of aSl-casein promoter was subcloned into 
pMH-l (Fig. 6) between the Ncol and Hindlll sites to 
form pMHS15' flank. 

2 0 The SV4 0 polyadenylation sequence (fragment 3) was 

subcloned as a BamHI-Dral fragment into pUC19 
immediately 3' to the 3 1 crSl-casein flanking sequence 
(fragment 2) to form pUC19 3' UTR/SV40. This allowed 
for the removal of a continuous EcoRI-Sall fragment 
25 (containing the 3 1 -flanking sequence and poly (A) 
sequence) that was subcloned into pMH-l to derive 
pMHS13'UTR (Fig. 5B) which was used later to construct 
pMHSI 3 1 UTR hlf which contains sequences encoding human 
lactof err in. 

3 0 The EcoRI-Sall sequence (fragments 2 and 3) were 

subcloned into the EcoRI-Sall sites of pMHSIS • flank to 
form pSIS'S'flank. 
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The Pstl-BamHI CAT fragment (fragment 4 in Fig. 5B) , 
after blunting the BamHI site with Klenow, was subcloned 
into pS13 1 5 1 f lank (Fig. 5B) between the PstI and Smal 
sites to form pS^'S'CAT. 

5 C . Construction of pSIS'CAT 

The CAT fragment (fragment 4 in Fig. 5B, Pstl-BamHI) and 
SV4 0 polyadenylation fragment (fragment 3 in Fig. 5A, 
BamHI-Dral) were subcloned into the PstI and Smal sites 
of pMHS15 • flank to form pSIS'CAT (Fig. 5C) . 

10 D. Assay for CAT Production 

Each of these CAT plasmids were transfected into human 
293S cells (Graham, F.L., etal. (1977), J. Gen. Virol. . 
36 , 59-72) by the calcium phosphate co-precipitation 
method (Gorman, CM., et al. (1983), Science, 221 . 551; 

15 Graham, F.L., et al. (1973), Virology , 52, 456-467). 
Cells were harvested 44 hours after transfection and 
cell extracts were assayed for CAT activity (Gorman, 
CM. , et al. (1982), Mol. Cell. Biol. , 2, 1011; 
deCr ombrugghe , B. , etal. (1973), Nature [London], 241, 

20 237-251, as modified by Nordeen, S.K., et al. (1987), 
DNA . jS, 173-178) . A control plasmid expressing CAT 
driven by the Cytomegalovirus Immediate early promoter 
(Boshart, M. , et al. (1985), Cell , 41 , 521) was 
transfected into human 293 S cells to assay for 

25 transfected efficiency. 

pSIS'S'CAT was expressed in these cells at a level which 
was approximately 3 0-100 fold lower than the control 
plasmid, but significantly higher than background. 
Primer extension analysis indicated that transcription 
3 0 had initiated predominantly in the expected region. 

When pSIS'CAT was transfected into 293S cells, 
expression was also detected. 
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EXAMPLE 4 



Bovine aSl-casein/human lactoferrin 
expression cosmid cGPlHLF 



A. 



Construction of DNA Sequences , 



5 16 kb of bovine aSl-casein 5' -flanking sequence from 
Example 1 was isolated from the bovine genomic library 
(phage GP1) as a Sall-Bglll fragment. The Bglll site 
lies at the junction of the first intron and second exon 
of the aSl-casein gene. 

10 Bovine aSl-casein signal sequence (Stewart et al. (1984) 
Nucl. Acids Res . 12, 3 895) was prepared from synthetic 
DNA synthesized on a Cylone Plus® DNA Synthesizer 
(Millgen/Biosearch I) and contained the entire signal 
sequence plus Xhol and Cla I sites attached to the 

15 S'-end, and Nael to the 3 ' -end (fragment 8, Fig. 7B) . 

Cleavage of pUC119 Lacto 4 . 1 with Eael precisely opened 
the plasmid at the codon for the first amino acid of 
mature hLF. Treatment with Klenow was used to fill in 
the overhanging 5 ! -end. Further digestion with AccI and 
20 EcoRI gave two fragments: (a) an Eael-AccI fragment 
containing the first 24 3 bp of mature hLF (fragment 5, 
Fig. 7C) , and (b) a contiguous AccI-EcoRI fragment 
(fragment 6, Fig. 7C) of 1815 bp that contained all but 
five terminal codons of the remaining coding sequence. 

25 A synthetic linker was prepared that contained the last 
five codons of hLF beginning at the EcoRI site and 
extending for four bases beyond the stop codon. A Kpnl 
site was added to the 3 ' -end (fragment 7 in Fig. 7C) . 

An 8.5 kb EcoRI 3' -fragment was isolated from the bovine 
3 0 genomic library (Fig. 4) containing sequences beginning 
just downstream of the coding region of aSl-casein and 
a BstEII site approximately 350 bp from the 5»-end. 
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This fragment was subcloned into pMH-1 at the EcoRI site 
to form pMH3'E10 (Fig. 7A) . A Sail site is adjacent to 
the 3 '-EcoRI site in pMH3»E10. 

B. Construction of cGPlHLF 

5 The hLF 3 1 -linker (fragment 7, Fig. 7C) was subcloned 
into the EcoRI -Kpnl sites of pMH3'UTR (Fig. 7A) to 
produce pMH 3 'UTRhLF2 linker (Fig. 7 A) . 

The synthetic bovine aSl-casein signal sequence 
(fragment 8) was then subcloned into the Xhol and Smal 
10 sites of pMH3 'UTRhLF2 linker to make pS13'hLFl/2L (Fig. 
7B) . 

The two hLF coding fragments (fragments 5 and 6 in Fig. 
7C) were subcloned into the Nael and EcoRI sites of 
pS13 •hLFl/2L (Fig. 7B) to make pSIS'UTRhLF (Fig. 7C) . 

15 The large aSl-casein 3 1 UTR fragment from pMH3 f E10 (Fig. 
7A) was isolated as a BstEII-Sall fragment and subcloned 
into the same sites of pS13 "UTRhLF to form phLF3 ■ 10 kb 
(Fig. 7D) . 

Cosmid cGPlHLF was prepared from a 3 -way ligation (Fig. 
20 7F) : 

(1) the 16 kb 5 1 -flanking sequence from phage 
GP1 (Example 1, Fig. 3) was modified by attaching two 
linker adapters. The Sail site at the 5 ' -end was 
ligated to a Notl-Sall linker. The Bglll site at the 

25 3 f -end was ligated to a Bglll-Xhol linker; 

(2) the hLF coding region, flanked on the 5 1 -end 
by the aSl-casein signal sequence and on the 3 • -end 
by approximately 8.5 kb of aSl-casein 3 1 -flanking 
sequence, was isolated as a Xhol-Sall fragment from 

3 0 phLF3'10 kb. The Sail site at the 5 • -end was ligated 
to a Sall-NotI linker; 




-62- 

(c) Cosmid pWE15 (Stratagene, Inc.) was 
linearized with Notl. 

Fragments from (a) , (b) , and (c) were ligated together 
and transfected into bacteria using commercial lambda 
5 packaging extracts (Stratagene, Inc.) to produce 
cGPlHLF. 

EXAMPLE 5 

Bovine otSl-casein/hLF expression plasmids . 

A. Construction of pS13 1 5 1 hLF 

10 The Hindlll-Sall fragment of pS13'UTRhLF was subcloned 
into the same, sites in pMHSlS' flank to form pSlS^'hLF 
(Fig. 7E) . This plasmid contains 681 bp of bovine 
aSl-casein promoter sequence, the aSl-casein/IgG hybrid 
intron, the aSl-casein signal sequence, the hLF coding 

15 region, approximately 1.6 kb of aSl-casein 3 # -flanking 
sequence, and the SV4 0 late region polyadenylation 
sequence. 

B. pS15 1 hLF 

Plasmid pS13 , 5 , hLF (Fig. 7E) was cut with Kpnl and BamHI 
2 0 which border the aSl-casein 1.6 kb 3 '-flanking sequence. 
The larger vector fragment was purified, made blunt 
ended with Klenow, and self -ligated to form pSIS'hLF. 

C. Radioimmunoassay for hLF 

An immunoglobulin-enriched fraction of ascites fluid of 
25 a monoclonal antibody against human lactoferrin, which 
does not cross-react with the bovine or murine protein, 
was prepared by 50% ammonium sulfate precipitation and 
coupled to CNBr-activated Sepharose 4B ( 20 mg of 
protein to 1 g of Sepharose) . The Sepharose beads were 
30 suspended (2 mg/ml) in phosphate-buffered saline (PBS; 
10 mM sodium phosphate, 0.14 M NaCl containing 10 mM 
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EDTA, 0.1% ( w /v) Polylorene and 0. 02% ( w /v) NaN 3/ pH 7 . 4 . 
Sepharose suspensions (0.3 ml) were incubated for five 
hours at room temperature by head-over-head rotation 
with samples (usually 50jxl) in 2-ml polystyrene tubes. 
5 Sepharose beads were then washed with saline (five times 
with 1.5 ml) and incubated for 16 hours at room 
temperature with 50^1 (lkBq) of 125 I-labeled-af f inity- 
purified polyclonal rabbit anti human lactoferrin 
antibodies, together with 0.5 ml of PBS, 0.1% ( w /v) 

10 Tween-2 0. Thereafter the Sepharose was washed again 
with saline (four times with 1.5 ml) and bound radio 
activity was measured. Results were expressed as 
percent binding of the labelled antibodies added. 
Levels of lactoferrin in test samples were expressed in 

15 nanomolar, using purified human milk lactoferrin as a 
standard (serial dilutions in PBS, 10 mM EDTA, 0.1% ( w /v) 
Tween-2 0 . 

Repeated testing of standard on separate occasions 
revealed that this RiA was highly reproducible, intra- 
2 0 and inter assay coefficients of variation ranged from 
5-10%. As little as 0.1 nanogram human lactoferrin is 
easily detected by this RIA. 

D. Expression in 293S cells 

29 3 S cells were transfected with the above hLF plasmids 
25 as described (ljxg of a CMV-CAT plasmid was co- 
transfected as control for transf ection efficiency) . 
Forty-four hours after transfection medium was removed 
from the cells and assayed for hLF as described supra , 
RNA was isolated as described by Stryker, et al. (1989) 
30 EMBO J. 8, 2669. The results can be summarized as 
follows: 

1. Transfection efficiencies are identical for the 
two hLF plasmids; 
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2 . hLF is expressed in the cells and secreted into 
the medium. In both cases, the levels are about 0.4/xg/ml 
medium using about 3x 10 6 cells 

3 . The proteins behave identical to hLF in a human 

milk sample in a dose response assay measuring the amount 
125 

of I- anti-lactof errin bound as a function of the 
amount of sample used. 

4. The protein has about the same size (*8 0kD) as in 
a human milk sample as judged by Western blotting. 

5. The hLF RNA produced in the cells has the correct 
size and its level is similar for both plasmids as 
judged by Northern - blotting. 

These data indicate that these two expression plasmids 
are able to express hLF. By all standards used so far, 
the protein is identical to hLF present in human milk. 
The heterologous signal sequence is functional in that 
it promotes secretion of the protein from the cells into 
the medium. Further, the casein regulatory sequences 
used in these plasmids are able to promote expression 
of a heterologous gene. 



Immature oocytes are obtained in large quantity 
(400-600/day) by aspirating follicles of ovaries 
obtained at abbatoirs. Immature oocytes are cultured 
for a period in vitro before they are competent to be 
fertilized. Once "matured", oocytes are fertilized with 
sperm which has also been matured, or "capacitated" in 
vitro. The pronuclei of the fertilized oocyte (or 
zygote) is then injected with the transgene encoding for 
the expression and secretion of human lactoferrin. 
Preferably the zygotes are substantially synchronous 



EXAMPLE 6 



In vitro Maturation, Fertilization 
and Culture of Bovine Oocytes - 
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such that greater than about 30, 50, 70, 90 or 95% of 
zygotes are in S-phase at the time of injection. 
Zygotes resulting from this in vitro fertilization and 
microinjection are then cultured to the late morula or 
5 blastocyst stage (5-6 days) in medium prepared, or 
"conditioned" by oviductal tissue. Blastocysts are then 
transferred non-surgically to recipient cattle for the 
balance of gestation or analyzed for integration of the 
transgene as described herein. 

10 In vitro maturation fIVM) . Ovaries are obtained 
immediately after slaughter at local abbatoirs and 
oocytes are recovered. Alternatively, oocytes are 
obtained from living cattle by surgical, endoscopic, or 
transvaginal ultrasonic approaches. In all cases, 

15 oocytes are aspirated from ovarian follicles (2-10 mm 
diameter) . After washing, oocytes are placed in a 
maturation medium consisting of M199 supplemented with 
10% fetal calf serum, and incubated for 2 4 hours at 
39°C. Sirard et al. (1988) Biol. Reprod . 39 , 546-552. 

20 In vitro fertilization (ivn . Matured oocytes are 
fertilized with either fresh or thawed sperm. Sperm are 
prepared for fertilization by first obtaining a 
population of sperm enriched for motility by a "swim-up" 
separation technique (Parrish et al. (1986) 

25 Therioaenolocry 25, 591-600) . Motil sperm are then added 
to a fertilization media, consisting of a modified 
Tyrode^s solution (Parrish et al. (1986) supra . ) 
supplemented with heparin to induce sperm capacitation 
(Parrish et al. (1988) Biol. Reprod . 38., 1171-1180) . 

3 0 Capacitation constitutes the final sperm maturation 
process which is essential for fertilization. Sperm and 
oocytes are co-cultured for 18 hours. A useful feature 
of this IVF method is that (in the case of frozen sperm) 
consistent, repeatable results are obtained once optimal 
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f ertilization conditions for a particular ejaculate have 
been defined (Parrish et al. (1986) supra . ) . 

In vitro culture (1VC) . Conventional culture systems, 
which support development of murine, rabbit, or human 
5 ova, do not support development of bovine embryos past 
the 8-16 cell stage. This problem has been overcome by 
pre-conditioning culture media with oviductal tissue. 
Oviduct-conditioned medium will support bovine embryos 
past the 8-16 cell stage to the blastocyst stage in 
10 vitro (Eyestone and First (1989) J. Reprod. Fert . 85, 
715-720) . 

Bovine embryos have proved refractory to in vitro 
culture. This in part stems from the existence of a 
"block" to cleavage in vitro at the 8-16 cell stage. 
15 This block may be alleviated by culturing embryos in the 
oviducts of rabbits (reviewed by Boland (1984) 
Therioa enoloav 21, 126-137) or sheep (Willadeen (1982) 
in: Mammalian Egg Transfer, (E. Adams, ed. , 

pp. 185-210) ) ; Eyestone et al. (1987) Therioaenoloav 2 8 . 

2 0 1-7) . However, these in vivo alternatives have been 

less than ideal, in that: (1) they require the 
maintenance of large numbers of recipient animals, 
(2) they require surgery to gain access to the oviducts 
for transfer, and a second surgery (or sacrifice) to 
25 recover the embryos, (3) all transferred embryos are 
seldom recovered, and (4) access to embryos during 
culture for observation or treatment is entirely 
precluded. The lack of in vitro culture systems has 
hampered the development of various manipulation 

3 0 techniques (such as gene transfer by pronuclear 

injection) by . preventing accumulation of basic 
information of the chronology and ontogeny of bovine 
development, and by complicating the process of 
culturing embryos to a stage compatible with non- 




-67- 

surgical embryo transfer and cryopreservation techniques 
(e-a-/ late blastocyst stages). 

Bovine embryos did not yield to attempts to culture them 
in vitro past the 8-16 cell "block" until Camous et al. 
(1984) J. Reprod. Fert - 72 , 479-485 demonstrated 
cleavage to 216 cells when embryos were co-cultured with 
trophoblastic tissue. 

The co-culture procedure was extended to oviductal 
tissue, based on the ability of homo- or hetero-oviducts 
to support development from zygote to blastocyst. Thus, 
bovine embryos co-cultured with oviductal tissue, or in 
medium conditioned by oviductal tissue, developed from 
zygote to blastocyst in vitro (Eyestone and First, 

( 1989 ) i= Reprod . Fert . 85 , 715-720; Eyestone W.H. 

(1989) "Factors affecting the development of early 
bovine embryos in vivo and in vitro . " Ph.D. Thesis, 
University of Wisconsin) . Blastocysts have been 
produced in this system after superovulation and 
artificial insemination, or by in vitro maturation 
(IVM) , and fertilization (IVF) of immature oocytes. 
Blastocysts produced in this fashion resulted in 
pregnancies and live calves after transfer to recipient 
animals. The results obtained were as follows: 



Number 
(per 100) 

90 

72 

22 

11 



Step 
IVM 
IVF 
IVC 

Embryo transfer 
(% pregnant) 



Efficiency 
Li] 

90 
80 
30 
50 
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Therefore, from an initial daily harvest of 500 oocytes, 
it is expected the approximately 55 pregnancies will 
result. 

Preparation of Oviduct Tissue 
5 Co-Culture and Conditioned Medium 

1. Obtain bovine oviducts after slaughter or by 
salpingectomy. 

2. Harvest lumenal tissue by scraping intact 
oviduct gently with a glass slide. 

10 3. Wash tissue 5 times in 10 ml modified tyrodes- 

hepes solution (Parrish et al. (1988) Biol. Reprod . 38 , 
1171-1180) . 

4. Resuspend final tissue pellet in M199 + 10% 
fetal calf serum at a ratio of 1 volume tissue: 50 

15 volumes of media. 

5. Tissue suspension can be used for embryo-co- 
culture. 

6. Alternatively, media may be conditioned for 
48h; after centrifuging the suspension, the supernatant 

2 0 may be used as embryo culture medium. Conditioned 

medium may be stored at -70 °C, if desired. Conditioned 
medium should be used at full strength for embryo 
culture (no dilution) (Eyestone (1989) ibid ) . 

EXAMPLE 7 

25 Microinjection of hLF 

Transaene into Bovine Pronuclei 

The DNA fragment containing the hLF expression unit is 
excised from the vector by digestion with the 
appropriate restriction enzyme (s) and separated on 

3 0 agarose gels. The fragment is purified by 

electroelution, phenol and chloroform extraction and 
ethanol precipitation (Maniatis et al.). The DNA 
fragment is dissolved in and dialyzed in 10 mM tris, 





-69- 

0.1 mM EDTA pH 7 . 2 at a concentration of 1 to 2/xg/ml. 
Microinjection needles are filled with the dialyzed DNA 
solution. 

Before in vitro fertilization, cumulus cells are removed 
5 from the egg by either vortexing at maximal speed for, 
2 minutes or pipetting the eggs up and down several 
times in a standard micropipet. Bovine pronuclei are 
injected in principle as murine pronuclei (Hogan, B. et 
al. (1986) in: Manipulating the mouse embryo, Cold 

10 Spring Harbor Laboratory) with an additional 
centrif ugation step in order to visualize the pronuclei. 
The injection takes place 18-2 4 hours after 
f ertilization. The time varies depending on the bull 
used as a source of semen. Different batches of semen 

15 cause the nuclei to become visible at different times. 

Bovine oocytes, matured and fertilized in vitro , are 
spun in an eppendorf tube in 1 ml of tyrodes-hepes 
solution (Parrish (1987)) at 14500 g for eight minutes 
(Wall et al. (1985) Biol. Renrod . 32., 645-651) . The 

2 0 embryos are transferred to a drop of tyrodes-hepes 

solution on a microscope slide covered with paraffin 
oil. Using a hydraulic system the oocytes are fixed to 
the egg holder in such a way that both the pronuclei are 
visible (using interference-contrast or phase contrast 
25 optics) . If necessary, the oocytes are rolled to change 
their position on the egg holder to visualize the 
pronuclei. The injection needle is brought into the 
same sharp focus of one of the pronuclei. The needle 
is then advanced through the zona pellucida, cytoplasm 

3 0 into the pronucleus. A small volume of 1-3 pi is 

injected (containing 20-100 DNA copies) into the 
pronucleus either by using a constant flow or a pulse 
flow (using a switch) of DNA solution out of the needle. 
Alternatively, two cell stage embryos are spun as 
3 5 described and the nuclei of both blastomers are injected 
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as described. The injected embryos are then transferred 
to a drop of co-culture medium as described in Example 6 
in order to develop to the morula or blastocyst stage. 



Upon the microinjection of a construct, the oocyte is 
cultured, A proper site of each embryo is cleaved and 
subjected to lysis (King, D. et al. (1988) Molecular 
Reprodu ction and Development 1, 57-62), proteolysis 
(Higuchi, R. , (1989) "Amplifications (A forum for PCR 
Users." 2, 1-3) and DPNI digestion. PCR is performed 
as described previously (Ninomiy, T. et al. (1979) 
Molecul ar Reprod. and Devel . 1, 242-248) with sets of 
two primers, one in aSl and the other in hLF cDNA 
sequence. For example, in a PCR where the forward 
primer (3 0mer) aSl sequence is 

ATG AAA CTT ATC CTC ACC TGT CTT GTG 

(Seq. ID No. : 16) 
and the reverse primer (3 0mer) in hLF sequence is GGG TTT 
TCG AGG GTG CCC CCG AGG ATG GAT (Seq. ID No. : 17) ; 971- 
1000 of Figure 1), a 990 bp fragment will be generated. 
This fragment contains the hitherto inactivated DpNI 
site by loss of adenosine-methylation, at 934 bp away 
from the start of the forward primer. 



EXAMPLE 9 

Production of hLF in Milk of Bovine Species 

Bovine morula developed from microinjected oocytes are 
split according to the method of Donahue (Donahue, S. 
( 1986 ) Genetic Engineering of Animals , ed. J. Warren 
Evans et al., Plenum). One half of the morula is kept 
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in culture to develop into blastocysts. The other half 
is subjected to the DNA analysis as described in 
Example 8. When the result of this analysis is known, 
the morula kept in culture are developed into a 
5 blastocyst or as a source for nuclear transfer into 
enucleated zygotes* Blastocyst transfer into 

synchronized cows is performed according to the method 
of Betteridge (Betteridge, K.J. (1977) in: Embryo 
transfer in farm animals: a review of techniques and 
10 applications) . 

hLF is detected in the milk of lactating transgenic 
offspring using the RIA of Example 5. 



15 Three overlapping phage clones that contain the complete 
hSA gene are used to construct an expression vector for 
hSA. They are designated \HAL-HA1, XHAL-3W and \HAL- 
H14. They are described in Urano, et al. (1986), J. 
Biol. Chem . . 261 , 3244-3251; and Urano, et al. (1984), 

20 Gene, 3_2, 255-261. The sequence of the gene plus some 
surrounding regions is published in Minghetti, et al. 
(1986), J. Biol. Chem . . 261 , 6747-6757. A single phage 
containing the complete hSA gene is constructed as 
follows : 

25 Clone HA-1 is cut with BstEII and Ahall. The «1400 bp 
fragment running from position 1784 (in the first exon, 
just downstream of the ATG) to 3181 is isolated and a 
synthetic linker is attached to the BstEII site at the 
5 1 end containing the first few amino acids that are cut 

3 0 off with BstEII as well as the sequence surrounding the 
ATG as well as a few convenient restriction sites. This 
fragment is called fragment #1. 
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Clone 3W is cut with Ahall and SacI the «13.1 kb 
fragment running from position 3181 to 16322 is isolated 
and a synthetic linker is attached to the SacI site to 
facilitate cloning in phage EMBL3 * This fragment is 
5 called fragment #2. 

These two fragments are ligated and cloned in phage 
EMBL3. After identification of the correct phage, a 
fragment running from just upstream of the BstEII site 
(where unique restriction sites have been introduced) 
10 to the SacI site are isolated and ligated from a SacI 
to Sail fragment (running from position 163 22 to «212 00 
isolated from clone H-14. These two fragments are then 
ligated and cloned in EMBL4 . 

After cutting with Clal (just upstream of the BstEII 
15 site, newly introduced) and BamHI (just downstream of 
the Sail site in the phage DNA) this new clone yields 
a fragment containing the complete hSA gene with about 
2.5 kb 3 1 -flanking sequence* 

To construct an expression vector for hSA cosmid cGPlHLF 

2 0 is partially digested with Clal and BamHI . This removes 

the signal sequence, the coding sequence of hLF, the 3 f - 
UTR and poly (A) addition region of aSl-casein as well 
as a small region 3 1 of the casein gene. 

This is ligated to the hSA fragment described above and 
25 the resulting cosmid is called cGPlHSA. 

The expression vector so formed contains, (1) 16 kb of 
promoter sequences derived from the aSl-casein gene, 
(2) the first exon and intervening sequence of this gene 
both present in GP1, (3) the signal sequence of the hSA 

3 0 gene the complete genomic gene coding for hSA including 

2.5 kb downstream of that gene, and (4) «8 kb of 
3 1 ~ f l anJ cirig sequence derived from the aSl-casein gene. 





-73- 



10 



15 



20 



25 



This transgene is used to produce transgenic bovine 
species producing hSA in their milk in a manner 
analogous to that used to produce hLF in the milk of 
bovine species. 



Purification of heterologous proteins from milk is 
facilitated by the fact that, following casein 
precipitation, those proteins, for the most part, are 
found in the whey fraction which is less contaminated 
than the production media used in microbial or cell- 
based systems. 

Chromatographic techniques are preferred for the 
purification of hSA from cow milk. This approach 
produces a better recovery and higher albumin purity as 
well as a lower content of albumin polymers as compared 
with ethanol fractionation (Curling (1980) in: "Methods 
of Plasma Protein Fractionation" , Curling, ed. , Academic 
Press London, UK; Curling et al. (1982) J. Parenteral 
Sci. Technol . 36 , 59; Berglof et al. and Martinache et 
al. (1982) Joint Meeting IHS-ISBT, Budapest) . The 
specific transport role of hSA as well as its major role 
in maintaining intravascular osmotic pressure may also 
be better preserved upon chromatographic purification 
(Steinbruch (1982), Joint Meeting ISH-ISBT, Budapest). 

The following steps are used to recover hSA produced in 
the milk of transgenic cows: 

1. Precipitation of caseins (about 80% of milk 
protein) and essentially all the milk fat at pH 4.5 



EXAMPLE 11 



Purification of HSA from 
the Milk of Bovine Species 
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and/or by adding chymosin. The whey fraction 
contains the albumin; 

2. Af f inity-chromatography of albumin on Cibacron blue 
3GA-Sepharose CL-6B (Harvey (198 0) in: Methods 

5 of Plasma Protein Fractionation, op. cit.) This 

step serves both to remove proteins other than 
albumin and to decrease the volume to be handled 
about 3 0-fold. Albumin is eluted from this matrix 
with 0.15 M NaCl and 20 mM sodium salicylate at 
10 pH 7.5; 

3. Buffer-exchange on Sephadex G-2 5: desalting into 
0.025 M sodium acetate, adjustment to pH 5.2, 
followed by filtration; 

4 . Anion-exchange chromatography on DEAE-Sepharose 
15 CL-6B . Desorption of albumin at pH 4.5; 

5. Cation-exchange chromatography on CM-Sepharose CL- 
6B. Albumin elution with 0.11 M sodium acetate, 
pH 5.5 and concentration of albumin at a 6% (w/v) 
solution by ultrafiltration; and 

20 6. Gel filtration on Sephacryl S-200. Fraction of 
high-molecular weight protein (e.g. albumin 
polymers, pyrogens) is discarded. The main 
fraction (albumin monomers) is concentrated by 
ultrafiltration and formulated. 
25 It is to be noted that steps 3-6 are essentially 
identical to the method described by Curling and others 
(Curling (1980) op. cit. ; Curling et al. (1982) op. 
cit.; Berglof et al. (1982) op. cit.) for the 
purification of hSA from plasma. 
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EXAMPLE 12 

Transgenic Mice Containing the Human 

Serum Albumin (hSA) Transgene 
Generated by Homologous Recombination 

5 Three overlapping genomic hSA clones were used to 
generate the hSA gene in transgenic mice, XHAL-HA1, 
XHAL-H14 and XHAL-3W, are shown in Figure 8 as reported 
by Urano, et al. (1984) , Gene , 32 . 255-2 61 and Urano, 
et al. (1986), J, Biol. Chem. , 261 3244-3251. Briefly, 

10 a genomic library was constructed from a partial EcoRI 
digest of human fibroblast DNA. For the clones XHAL-H14 
and XHAL-3W, this library was screened with 32 P-labeled 
human albumin genomic clones by hybridization in 1 M 
NaCl, 50 mM Tris-HCl (pH 8.0) , 10 mM EDTA, 0.1% SDS, 100 

15 ug/ml of sheared salmon sperm DNA and lOx Denhardt's 
solution at 65° C overnight after prehybridization in 
3x SSC and lOx Denhardt's solution. Following 
hybridization, filters were washed in 0.2x SSC and 0.1% 
SDS at 65° C. The isolation of the XHAL-HA1 clone was 

20 identical except that a 0.9 kb Bglll-EcoRI fragment from 
the 5 1 end of XHAL-3W was used to screen the human 
fibroblast library. 

These three hSA phage clones were used to generate three 
overlapping linear DNA fragments, which in composite 

25 comprised the whole HSA gene and flanking regions. The 
5 1 most fragment I was a EcoRI-EcoRI fragment isolated 
from XHAL-HA1; the middle fragment II was a Acyl 
(-Ahall) -Sad fragment of XHAL-3W; and the 3 f most 
fragment III was a Xhol-Sall fragment of XHAL-H14 

3 0 (Fig. 7) . The fragments were treated with klenow DNA 
polymerase and dNTP • s to fill, in overhanging sticky 
ends. In some experiments, the blunt ended fragments 
were then treated with bacterial alkaline phosphatase 
to remove the 5 1 phosphate groups from each fragment. 

3 5 The overlapping DNA fragments were next concentrated 
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then coinjected into the male pronuclei of fertilized 
mouse eggs according to published methods (Hogan, et al. 
(198 6) in "Manipulating the Mouse Embryo: A Laboratory 
Manual'', Cold Spring Harbor Laboratory). While the 
5 number of molecules injected varied from «25 to «100 per 
egg cell, the ratio of the individual fragments was 
approximately 1:1:1. Embryos were implanted into the 
uteri of pseudo pregnant female mice according to the 
methods of Hogan, et al., su pra . 

10 To assay correct homologous recombination of the three 
overlapping fragments and integration of the nascent 
transgene into the mouse genome, genomic DNA from the 
newborn pups was subject to the following specific 
digestions followed by Southern hybridization with HSA 

15 cDNA probes: 

Bst EII: cuts outside the HSA gene region and yields an 
18 kb band if correct recombination occurred; 

Nco I: cuts outside the overlapping regions and yields 
bands of 8.0 and 9*3 kb if correct recombination 
20 occurred; 

Nco I + Hind III: cuts at several positions outside the 
region of overlap, indicative of the presence of intact 
f ragments ; 

Hinc II: cuts in the overlapping regions, yielding 
25 several bands indicative of correct arrangement in these 
regions . 

In an initial experiment of 28 transgenic animals born, 
22 had correctly recombined all three fragments. From 
20 out of those 22 animals blood was collected and 
3 0 assayed for the presence of hSA protein using a radio 
immuno assay. 15 out of those 2 0 animals showed hSA 
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expression at levels between 0.5 and 5 ^ug/mL. None of 
the animals that had no recombination or that were not 
transgenic showed any expression. Using RNA blots, only- 
two (the two with the highest protein level) showed a 
5 band. We are currently performing blots on RNA that has 
been enriched for the presence of mRNA (i.e., poly (A) + 
RNA) . Using reverse transcriptase to synthesize cDNA, 
followed by PGR, we have observed a perfect relationship 
between the presence of RNA and protein. However/ in 
10 this experiment we could not determine the size(s) of 
the RNA. 



15 This example describes the construction of two hLF 
transgenes wherein the first contains approximately 
16 kb of aSl casein 5 f expression regulation sequence 
(pGPlhLF (16 kb) also referred to as pl6,8HLF4) and the 



20 expression regulation sequence (pGPlhLF (8 kb) also 
referred to as p8.8HLF4). The overall strategy for 
these constructions is depicted in Fig. 9. 

A 1.8 kb EcoRI-Bglll fragment (fragment C in Fig. 9) was 
isolated from phage clone GP1. This fragment runs from 

25 position -100 of the transcription start site into the 
second exon of the aSl casein gene. The Bglll site lies 
at the junction of the first entron and second exon of 
the aSl casein gene. The 3" end containing the Bglll 
site was ligated to a synthetic Bglll-Clal linker and 

3 0 subcloned into the plasmid pUC19 . The resulting plasmid 
is designated pEBS. 
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Fragment B in Fig. 9 was isolated as an EcoRI fragment 
and cloned into the EcoRI site of pEBS. Fragment B 
includes sequences from position -7500 to position -100 
of the transcription start site in the aSl casein gene, 
5 The plasmid so formed is designated pEB3S and contains 
the combination of fragments B and C is the 8.9 kb 
EcoRI-Clal fragment running from position -7500 to 
position +1400 of the transcription start site. The 
8.9 kb EcoRI-Clal fragment from pEB3 , obtained by 
10 complete digestion with Clal and partial digestion with 
EcoRI was isolated and subcloned into EcoRI-Clal cut 
pKUN2 (a derivative of pKUN; Gene (1986) 46, 269-276 
containing a NotI restriction site) to form pNE3BS. 

An 8.5 kb Clal-EcoRI fragment (fragment A in Fig. 9) 
15 running from position -16000 to position -7500 of the 
transcription start site was isolated from phage GP1. 
It was thereafter subcloned into pUC19 to form pSE. 
Using synthetic oligonucleotide, a unique NotI site was 
introduced into the Clal site thereby destroying it. 

2 0 The resulting plasmid is designated pNE. 

The insert from pNE was isolated as a Notl-EcoRI 
fragment and together with the EcoRI-Clal insert from 
pNE3BS was ligated into the cloning vector pKUN2 . The 
resulting plasmid pGPl (A2ex) contains 16 kb of aSl 
25 casein promoter plus the 5 1 end of the gene to the Bglll 
site at the border of the second exon. 

The final plasmid (16,8HLF4) containing the transgene 
was assembled using the Notl-Clal fragment from clone 
pGPI (A2ex) and the Xho-NotI fragment from clone pHLF 

3 0 3 1 10 kb. The structure of this transgene is the same 

as previously described herein. 

As a minor modification to this plasmid the Sail site 
of this plasmid was removed by cutting with Sail and 
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inserting a linker that contains a NotI site, but not 
a Sail site. Subsequently, a Sail site was introduced 
just downstream of the hLF sequence by cutting the Kpnl 
site as that position adding the following linker: 

5 5 ' -CGTCGACAGTAC-3 1 (Seq. ID No.: 18) 

CATGGCAGCTGT-5 1 (sEQ. id nO . : 19) 
In effect, the hLF sequence is now surrounded by two 
unique restriction sites (Clal and Sail) and can be 
replaced by any recombinant ANA sequence that has a 
10 Clal-site at the 5 1 - end and a Sail-site at the 3 1 - end. 

Another transgene was constructed that is identical to 
the foregoing except that it contains only about 8 kb 
of 5' oeSl casein expression regulation sequence. It was 
constructed by taking the Notl-Clal fragment from pNE3BS 
15 and fusing it directly into Xho-otI fragment from clone 
pHLF 3' 10 kb. The resulting plasmid was designated 
pGPIhLF (7 kb) (also referred to as p8.8HLF4). 

Plasmid 16,8hLF4 was modified to contain a hybrid splice 
signal (aSl casein-IgG) described in examples 3 and 5. 
20 The resulting plasmid was designated 16,8hLF3 and is 
identical to 16,8hLF4 except for the presence of a 
hybrid intron versus a "natural" casein intron in the 
5 ■ -UTR. 

The hLF signal sequence can also be used in all of the 
25 cDNA constructs disclosed herein instead of the casein 
signal sequence. This can be done in the following way: 
A synthetic oligo was made that contains the complete 
hLF signal sequence (see Fig. 2) plus a Clal restriction 
site at the 5'-end and an EagI restriction site at the 
3 0 3 '-end. These restriction sites also border the casein- 
signal sequence in the other plasmids (e.g. , pl6,8hLF4) . 
A fragment containing the hLF-cDNA surrounded by Clal 
and Sail sites was cloned in pGEM7 (Stratagene, Inc.) 
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containing a Clal and Sail site. The resulting plasmid 
was digested with Clal and EagI and used as a vector to 
accommodate the Clal-EagI fragment containing the hLF 
sequence. From the positive clones, the cDNA, with its 
5 own sequence f was excised as a Clal-Sall fragment and 
inserted in Clal-Sall digested pl6 / 8hLF4 to generate 
pl6,8hLF5. Similarly, this Cla-Sal fragment containing 
the hLF-cDNA plus hLF signal sequence can be inserted 
in any hLF cDNA vector. 

10 EXAMPLE 14 

Production of Recombinant Human Lactoferrin and 
Human Serum Albumin in the Milk of Transgenic Mice 

Transgenic mice were generated utilizing several of the 
transgenes identified in the examples herein. The 
15 transgenes used are identified in Tables 3 and 4. In 
each case, the 5 1 and 3' expression regulation sequences 
were from the bovine aSl casein gene, the RNA splice 
signal in the 5' untranslated region was either 
homologous from the aSl casein gene or a hybrid casein- 

2 0 IgG intervening sequence. The recombinant DNA in each 

case was derived from cDNA clones. 

The transgene containing 2 6 kb of 5' aSl casein 
expression regulation sequence was generated by in vivo 
homologous recombination of overlapping fragments. 
25 Briefly, a phage clone containing an approximately 14 kb 
Sail insert was identified. This insert contains about 
11.5 kb of sequence upstream from the 5 1 casein sequence 
contained in 16,8hLF4 and about 2.5 kb of overlapping 
sequence. The NotI insert from 16,8hLF4 and the Sail 

3 0 phage insert were coinjected to produce the 2 6.8hLF4 

mice. 




T 




Plasmid 
from 
which 
transgene 
excised 

pO . 7 , 8 
hLF4 

p8,8 
hLF4 

*pl6,8 
hLF4 

p26,8 
hLF4 

pl6,8 
hLF3 



Length 

of 5 1 - Length 

expression of 3*- 

regulator expression 



segment 
0.68 



6.2 



14.5 



26 



14.5 



regulator 
segment 



TVS 



Maximum Range 
expression of mean 
Number levels levels 
of recorded recorded 
(Mq/Ml) (jug/ml) 



strains _ 

8 homologous 6 0.0-0.8 0.0-0.1 

8 homologous 6 5-36 2.5-16 

8 homologous 5 0.3-3.6 0.0-1.8 

8 homologous 5 0.6-10 0.2-1.7 

8 heterologous 13 0.0-708 0.0-200 



*The number in the plasmid designations before the comma represents the 
approximate length in kbp in the 5' sequence from the bovine aSl casein 
promoter/ flanking region while the number after the comma represents 
the approximate length in kbp in the 3 ' flanking sequence of the aSl 
gene. Note the actual number of bases for the 8 kb and 16 kb promoter 
(5* flanking region) is 6.2 and 14.5 kbp, respectively. 

^Exception: An additional pl6,8hLF4 transgenic mouse (line 145) not 
included in the data in Table 3 gave a maximum expression level of 224 
/ig/ml and a mean of 112 ^g/ml. 
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TABLE 4 

16,8hLF 3 Expression Data 

5/13 - High Expressors : 

Mean level of Maximum level of 

5 Mouse expression range expression range 

line No. fug/ml) ( ug/rol) 

27 33.5 97.5 

29 37.5 66.0 

32 21.2 148.0 

10 33 200.0 708.0 

38 25.0 126.0 

8/13 - Low Expressors : 

0.0-1.7 0.2-18 



The data in Tables 3 and 4 demonstrates that the hybrid 
15 intron + heterologous splice acceptor site dramatically 
increases expression levels in a significant number of 
cases (5/13) . 

The construct 16,8hLF4 is expressed at high level (in 
same range as 16,8hLF3). However, (in mice) this only 
20 occurs in a small number of cases and 1/16 when 8 / 8hLF4 
and 26,8hLF4 are included). Similar results were 
obtained using a hSA cDNA. 

Briefly, the 16,8hSA4 transgene was constructed by 
digesting pl6 f 8hLF4 with Clal and Sail to remove the hLF 

25 cDNA sequence. hSA cDNA was excised from a clone with 
EcoRI. A Clal synthetic linker was added to the 5 1 
(upstream) end and a Sail linker to the 3 1 (downstream) 
end. After insertion into the Clal/Sall digested 
16,8hLF4 vector, 16,8hSA4 was formed from which the NotI 

30 insert was excised and used for microinjection. 

The 16,8hSA4 construct yield 9 lines. One of the 9 
lines gave high level expression (100 /ig/ml) ) , while the 



%3 
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remaining 8 of 9 gave low expression (0.01-0.05 /ig/ml) . 



expression in the mouse mammary gland are not determined 
by the particular cDNA used, but are an inherent 
5 characteristic of the 16 , 8X4 construct (i.e., the 16 kb 
5* and 8 kb 3 • flanking regions of the a-Sl casein gene 
combined with the heterologous IVS) . 

The data also show that 0.7 kb of 5' a-51 casein 
flanking sequence does not drive high level expression 
10 and that 8 (6.2), 16 (14.5) and 2 6 kb are more 
effective. In this respect, 8 kb . is slightly more 
effective than 16 or 26 kb of 5' flanking sequence. 

Also, RNA analysis has shown that expression of the cDNA 
constructs is tissue-specific and stage-specific (i.e., 
15 expression is only observed in the lactating mammary 
gland) , that the transcripts are correctly sized and 
that RNA and protein levels correlate. 



20 Transgenesis in the bovine system was obtained utilizing 
the pl6,8hLF4 transgene described in Example 13. 



Bovine oocytes were collected by aspiration of follicles 
present on ovaries obtained from slaughterhouses and 

25 transported in an insulated container at 30-32 °C. 
Oocytes, together with follicular fluid, were aspirated 
from 2-8 mm diameter follicles and pooled into 50 ml 
conical tubes. Cumulus-oocyte complexes (COC) were 
allowed to settle into a pellet, after which the 

30 supernatant was discarded and the pellet washed in 50 
ml TL-Hepes (Vander Shaws, et al. (1991) Therioqenolocrv 



This indicates that the level and the frequency of hLF 
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35 , 288 (Abstr.)- COC, containing several intact, 
unexpanded cumulus cell layers, were selected and 
isolated under a dissecting microscope at 15 x 
magnification, washed four times in 10 ml TL-Hepes, once 
5 in 2-3 ml TCM199+10% fetal calf serum (M199) and then 
paraffin oil (20 COC/droplet) . COC were incubated for 
23 h in a humidified atmosphere of 5% C0 2 in air at 39 °C. 

A total of about 2500 oocytes were used. On average, 
two aspiration sessions occurred per week. The yield 

10 of aspirated oocytes was highly variable from day to 
day, with a mean daily number of about 150. Maturation 
and fertilization were analyzed by cytological analysis. 
Maturation was defined as the breakdown of the nuclear 
membrane, the appearance of the first polar body and a 

15 metaphase plate. Oocytes were fertilized in vitro with 
frozen thawed-sperm obtained from three different bulls 
with excellent characteristics with respect to genetic 
background, field performance and ease of calving. 
Sperm capacitation was facilitated with heparin. 

20 Parrish, J. et al. (1986) Theriocrenoloav 25 : 591-600 . 
Since sperm from individual bulls respond differently 
to specific fertilization conditions , semen from each 
lot was tested in advance to determine optimal heparin 
and sperm concentration required to maximize normal 

25 fertilization frequency and to minimize polyspermy. 
Fertilization conditions for a given bull were selected 
after screening at heparin concentrations of 0.0, 1.0 
and 10.0 mg heparin/ml, and at 1.0, 2.0 and 4.0 x 10 6 
motile sperm/ml. Since the proportion of sperm that 

30 survives freezing and thawing varies from bull to bull 
(approximately 30-60% for the bulls was used here) sperm 
preparations were enriched for live, motile sperm by a 
"swim-up" procedure (Parrish, J. et al. Ibid ) , 
alternatively, sperm were centrifuged through a percoll 

35 gradient. After isolation of the motile portion, sperm 
were counted on a hemocytometer , diluted to an 
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appropriate concentration to yield a 25-fold 
concentrated stock. The fertilization medium consisted 
of TALP medium (Banister, Bethal Biol. Reprod. 28.: 235- 
2 47) supplemented with 2.0-10.0 mg/ml heparin (from 
5 porcine intestinal mucosa, 177 IU/mg; Sigma) and if the 
cumulus was removed prior to fertilization, ImM 
hypotaurine, lOmM penicillamine, 2 0mM epinephrine and 
2mM sodium metabisulf ite . Matured COC were selected on 
the basis of expanded cumulus masses for fertilization, 

10 washed once in 10 ml fertilization medium, and either 
added directly to fertilization droplets, or first 
stripped of their cumulus investment by gentle pipetting 
through a small-bore, fire-polished pipet and then added 
to the droplets. Finally, sperm cells were added to a 

15 final concentration of 1 x 10 6 -2.0 x 10 6 /ml. After 16-24 
h, presumptive zygotes were removed from fertilization 
droplets. At this point, 2 0-3 0 zygotes for each 
experiment were fixed in 3 : 1 ethanol : acetic acid for 24 
h, stained with 1% aceto-orcein (in 40% acetic acid), 

20 and examined to determine fertilization frequency 
(percentage of sample with 2 pronuclei and a sperm 
tail) . For each batch of semen, the 1 in vitro' 
fertilization conditions (heparin concentration and 
sperm number) were optimized to obtain normal 

25 fertilization rates ranging from 50 to 70% as determined 
by the presence of two pronuclei and a sperm tail as 
described above. Either one of two techniques were used 
for selection of motile sperm: the swim-up technique 
and centrifugation through a Percoll gradient. No 

30 significant differences in fertilization rates between 
these methods were recorded. The efficiencies of these 
and the following steps are shown in Table 5. The 
remaining oocyte were then prepared for microinjection. 



TABLE 5 



35 



Efficiencies of the steps involved in the process 
from immature bovine oocytes to transgenic calves 
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step 


Total No. 


* 

Percent 


oocytes 


2470 


_ 


matured 


2297 


93 


fertilized 


1358 


61 


injected 


1154 


85 


survival 


981 


85 


cleavage 


687 


70 


transferred 


129* 


1.9 


pregnant 


21 


2 . 1 


integration 


2 


1.0 


* 

Percentages indicate the proportion of embryos or cells 
that successfully complete each step. 

*Sixty-nine transfers of single blastocytes resulting in 
7 pregnancies; 3 0 transfers of twinned embryos, 
resulting in 14 pregnancies. 



10 



15 



Microinjection. The 2 6 kbp casein-hLF transgene (from 
p 16 , 8hLF4 ) used for microinjection was released by NotI 
digestion and purified by agarose gel electrophoresis 
and electroelution. The final DNA concentration was 

20 adjusted to 2.5 /ig/ml. Batches of 50 cumulus-intact 
fertilized oocytes were stripped either as described 
above or by vortexing 2 minutes in 2ml TL-hepes medium 
in a 10ml conical tube. In order to visualize the 
pronuclei, cumulus free oocytes were centrifuged in 1ml 

25 TL-hepes medium 8 minutes at 14,500 x g in an Eppendorf 
centrifuge. Wall, R. et al. Biol. Reprod. 32 : 645-651. 
Microinjection was performed essentially as described 
by Hogan B. et al. (1985) Manipulating the Mouse 
Embryo: A Laboratory Manual, Cold Spring Harbor 

30 Laboratory Press, Cold Spring Harbor, N.Y.. 



Embryo culture. Embryos were cultured from the zygote 
to the compact morula or blastocyst stage in oviductal- 
tissue conditioned medium. Eyestone et al. , (1991) J. 
Reor od . Fert . 92:59-64. Oviducts were obtained at 
35 slaughter and transported at ambient temperature. 
Luminal tissue from 2-4 oviducts (1-2 cows) was 
harvested by gently scraping intact oviducts on the 
outside with a glass slide. The extruded material was 
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washed 5 times in 10 ml TALP-Hepes and diluted in M199 
to a tissue: media ratio of 1:50. Media were conditioned 
in 50 ml M T" flasks containing five ml of oviduct tissue 
suspension. Conditioned media frequently contained a 
5 proteinaceous precipitate after thawing, which was 
removed by centrifugation. Droplets were covered with 
paraffin oil and were incubated for 2 h to- permit pH to 
equilibrate prior to adding zygotes. Zygotes were 
placed in culture droplets within 2 h after 
10 microinjection. Initial cleavage (>2 cells) was 
assessed 42 h after adding sperm. Media were not 
changed during the course of incubations. Criteria for 
normal development consisted of attainment of the 
compact morula or blastocyst stage. 

15 Embryo transfer. The synchronization schedule was set 
up so that recipients started estrous on the same day 
at which oocytes were aspirated from slaughterhouse 
ovaries (i.e., start of maturation is day 1). Estrous 
in recipient cattle was synchronized with a 9-day 

20 Norgestamet (Intervert, Boxmeer, The Netherlands) 
treatment (administered in an ear implant according to 
the manufacturer) , and a 500 /ig dose of cloprostanol 
given on day 7 of the Norgestamet treatment. Estrous 
occurred within 2-3 days after implant removal. Embryos 

25 were transferred non-surgically to recipient heifers 5-7 
days after estrous (1-2 embryos /uterine horn) . 
Recipients received 9-day old embryos, at which time 
they have developed to the compact morula or early 
blastocyst stage. These embryos are one day ahead in 

30 development compared to the stage of the estrous cycle 
of the recipients. In case of two microinjection 
sessions on subsequent days, one group of recipients was 
used that were in synchrony with the first batch of 
oocytes collected. Transfers of embryos that developed 

35 from oocytes aspirated on the day of the start of 
estrous gave better results than embryos from oocytes 
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obtained one day later. Due to the somewhat delayed 
development of microinjected embryos, there appeared to 
be a better synchrony between the recipients and the 
first group of embryos. Recipients received two embryos 
5 when the quality grade (according to Linder and Wright, 
Therioaenoloay 20:407-416) was fair to poor and only one 
single embryo when the quality grade was excellent to 
good. Each pregnant recipient that received 2 embryos 
carried only one fetus to term. The overall pregnancy 
10 rate was 21%, which is significantly less than the rates 
reported by others with non-microinjected embryos which 
had developed in vivo (Linder and Wright, Ibid and Massy 
et al. (1984) Theriogenolocrv 21^:196-217). In the 
experiments described here, no transfers with non- 
15 injected embryos were performed. 

Pregnancy was determined by rectal palpation at 45 to 
60 days of gestation. A total of 21 pregnancies were 
established (confirmed by rectal palpation 45-460 days 
after transfer) . During pregnancy, 2 fetuses were lost. 

20 One recipient aborted spontaneously for unknown reasons 
at 7.5 months of gestation. The second fetus, collected 
at slaughter of the recipient at 3 weeks after the 
calculated day of parturition, was a full grown dead 
"calf having an abnormal embryonic development called 

25 'schistosoma reflexum' . In both cases, no intact DNA 
could be isolated for analysis. Nineteen calves were 
born after normal pregnancies. One of these calves died 
during parturition, and a second, 24 hours after birth, 
because of pneumonia following accidental inhalation of 

30 milk. A third calf, born after a pregnancy of 10 months 
and with a body weight of 7 0kg was euthanized at an age 
of 3 weeks. Pathological analysis indicated that the 
animal was suffering from sepsis due to chronic 
omphalophlebitis. Tissues that could be analyzed from 

35 the three dead calves contained no integrated human 
lactoferrin (hLF) sequences. Therefore, the cause of 
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their death is unlikely to be related to transgene 
integration. The remaining 16 calves are in excellent- 
health. 

Structure of the transgene. In Fig. 12 A, the coding 
s sequence of the hLF cDNA is depicted by a 'hatched box. 
The position of the translational start and stop codon 
is indicated. The 5' and 3' untranslated regions are 
encoded by aSl casein exons (open boxes) . Intervening 
sequences interrupting these exons are represented by 

10 a single line. The expression unit is surrounded by 
flanking sequences derived from the bovine aSl casein 
gene (indicated by a double line) . Positions of 
restriction enzyme sites are indicated by the following 
symbols: R, EcoRI; A, Asp718; N, Notl. The NotI sites 

15 are not present at the indicated positions in the bovine 
aSl casein gene itself, but were introduced by synthetic 
linkers. The black bar represents the position of the 
probe used to detect the presence of the transgene. 
Sizes of the fragments (in kbp) obtained after digestion 

20 with EcoRI or Asp718 are shown at the bottom. 

DNA analysis. DNA was isolated from placenta, blood and 
ear tissue from all calves. A Southern blot analysis 
of the extracted DNA is shown in Fig. 12B. Ten fig of 
DNA was loaded per lane. Fragment size markers are in 

25 kbp (Hindlll digest of lamda DNA) are indicated on the 
left. Lane 1, EcoRI digested human DNA (isolated from 
blood), lane 2, Eco RI digested DNA from calf #4 
isolated from blood; lane 3, Asp718 digested DNA from 
calf #4 isolated from blood; lane 4, EcoRI digested 

30 placental DNA from calf #4; lane 5, Asp718 digested 
placental DNA from calf #4; lane 6, EcoRI digested DNA 
from calf #15 isolated from blood; lane 7, Asp718 
digested DNA from calf #15 isolated from blood; lane 8, 
EcoRI digested DNA from calf #15 isolated from ear 

35 tissue; lane 9, Asp718 digested DNA from calf #15 
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isolated from ear tissue; lane 10, EcoRI digested 
placental DNA from calf #15; lane 11 , Asp718 digested 
placental DNA from calf #15; lane 12, EcoRI digested DNA 
isolated from the tail of a transgenic mouse harboring 
5 the same construct, DNA extraction, Southern blot 
analysis and hybridization were performed according to 
standard procedures. The probe used in the Southern 
blotting experiment was a 758 bp EcoRV-EcoRI fragment 
covering the 3 * part of the hLF cDNA of Figure 2 . 

10 Southern blot analysis using hLF cDNA as a probe 
indicated that in tissues of two calves (#4 and #15) 
transgene sequences had been integrated into the host 
genome. Calf #15 (a female) was mosaic for integration 
of the transgene. Placental tissue was positive, 

15 whereas in blood and ear tissue no hLF sequences could 
be detected. The copy number in the placenta was 1-2. 
The restriction enzyme map of the transgene was 
different from that expected based on the map of the 
casein-hLF plasmid (FIG. 12A) and based on the pattern 

20 obtained in many individual transgenic mice (data not 
shown) . Apparently, a rearrangement had occurred 
involving a deletion of part of the DNA construct. It 
is not clear whether this rearrangement event is related 
to the fact that the transgene could not be detected in 
.25 all tissues. In mice, it has been shown that over 3 0% 
of all transgenic animals born are mosaic. 

Calf #4 (a male) showed, in all three tissues, the same 
hybridization pattern that was identical to the expected 
one. Restriction digestions with different enzymes 

30 indicated that head-to-tail concatamers of intact copies 
had integrated and there was no indication of 
rearrangements. Copy numbers were estimated by 
comparing the intensities of the transgenic band with 
bands resulting from hybridization of the hLF probe to 

35 human DNA (FIG. 12B) . In calf #4, between 5 and 10 




-91- 

copies of the transgene had integrated in all three 
tissues examined. 

An analysis of sperm producted by calf #4 detected no 
abnormalities. DNA was subsequently isolated from the 
5 sperm and analyzed for the presence of the hLF- 
transgene. It appeared that the copynumber of the 
transgene (2-3) was the same in sperm as in other 
tissues indicating that calf #4 is not mosaic and should 
be able to transmit the transgene to 50% of his 
10 offspring. 



EXAMPLE 16 

Construction of Transgene Cassette 
for Genomic Recombinant DNA 



The plasmids described so far all contain regions 
5 derived from the bovine aSl-casein untranscribed regions 
(including intervening sequences) . When a genomic gene 
is to be expressed that already contains untranslated 
regions and intervening sequences permissive for high 
expression, it is preferable to use expression cassettes 
10 where the flanking regions including the transcription 
initiation site of the ceSl casein gene are operably 
linked to the untranslated regions of the gene to be 
expressed. Such an expression cassette is p-16 kb,CS 
and was constructed as follows: plasmid pSl 3'5'hLF was 
15 used as a template in a PCR experiment. This plasmid 
contains 680 bp of promoter sequence of the aSl casein 
gene as well as its first exon. the rest of this 
plasmid is not relevant for this experiment. The 
upstream primer was located just upstream of the insert 
20 in the plasmid moiety (just upstream of a NotI 
restriction site) . Its sequence i^: S^CGA CGT TGT AAA 
ACG ACGG-3 1 . 
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The downstream primer was located in exon 1. Its 
sequence matches the first 19 bp of the exon exactly and 
also has a non-hydridizing region of 17 bp contaii 



a Clal and a Sail site. It has the following sequence; 
5 5 ' -ATTGTCGACTTATCGATGGGTTGATGATCAAGGTGA-3 ■ 

The amplified fragment was digested with NotI and Sail 
and ligated into pKUN2 (see Example 13) . The resulting 
plasmid (p-68 0CS) therefore harbors a proximal promoter 
fragment from -680 to + 19, plus two restriction sites 
10 just downstream of those 19 bp. 

This plasmid was digested with NotI (just upstream of - 
680) and Nsil (at-280) and used as a vector to ligate 
to a fragment running from a NotI site (just upstream 
of -16 kb) to Nsil (-280) isolated from pl6,8hLF4 

15 (Example 13). This plasmid (p-16 kb,CS) therefore 
harbors a promoter fragment from «-16,000 to +19. It 
can be used to insert genomic genes that carry their own 
UTR's and poly (A) -signal . After insertion of the 
genomic gene as a Clal-Sall fragment, the aSl casein 3'- 

20 flanking region can be inserted as a Sall-f ragment . 



EXAMPLE 17 

Construction of Transgene 
for Production of Protein C 



The genomic sequence of Protein C has been published. 

25 Foster, et al. (1985) Proc. Natl. Acad. Sci. USA 82, 
4673-4677. This sequence, however, does not include the 
first exon which was identified through the cDNA 
sequence published by Beckman, et al. (1985) Nucl. Acids 
Res. 13 f 5233-5247. The first exon of Protein C is 

30 located at position -1499 to -1448 in the Foster 
sequence. The transgene for expressing and secreting 
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Protein C into the milk of bovine species is shown in 
Fig, 10. This transgene was constructed as follows. 

A human genomic library in EMBL-3 (Clonotech) is probed 
with a sequence specific for protein C. A purified 
5 phage DNA prep containing the complete Protein C gene 
is isolated. The phage is isolated from an Ejl. coli 
strain having the Dam phenotype, such a strain GM113 . 
This results in cloned DNA which is not methylated and 
as such all Clal restriction sites can be cleaved. 

10 A Clal Nhel fragment running from positions +13 3 3 to 
11483 is isolated. This is designated fragment I. 

PGEM7 (Stratogene, Inc.) is digested with SphI and Smal. 
The region in between is replaced by the corresponding 
region of plasmid pKUN ( Gene (1986) 46, 269-276) . The 
15 resulting plasmid is designated pGEM7A and has the 
following restriction map in the relevant region: 


Hindlll Clal Xbal Sail Spel 



20 Two primers are , synthesized. Primer GP125 has the 
following sequenbe^S JTI?^ ^) 

5 ■ - CAA ATC GAT TGA ACT TGC AGT ATC TCC ACG AC - 3 ' 
Clal 

Primer GP 12 6 has the following sequenc^^^^^ 

25 5 1 - GGG ATC GAT CAG ATT CTG TCC CCC AT - 3 1 

Clal 

Primer GP125 has an overlap with exon O (position 654 
to 675 of the Protein C gene) and introduces a Clal site 
in the 5' untranslated region. Exon 0 is the exon not 



oA 
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identified by Foster, et al. Primer GP12 6 overlaps the 
region from 13 44 to 1315 in the Protein C gene. This 
region contains a Clal site. 

The region between position 654 and 1344 is amplified 
using either human DNA or phage DNA as a template. The 
so amplified material is digested with Clal and cloned 
in vector pGEN7a to form pPCCC. This vector is 
propagated in a dam negative strain such as GM113 and 
partially cut with Clal (only the plasmids that are cut 
once with Clal at position 1340 are of interest) and 
completely with Xbal. The Clal Nhel fragment (fragment 
1) is cloned into this vector. The resultant plasmid 
is designated pPC. Its structure is shown in Fig. 10. 
From this plasmid, the Protein C transgene is isolated 
as a Clal-Sall fragment and ligated into pl6 kb, CS (See 
Example 15) to generate a transgene capable of 
expressing Protein C in bovine milk, this plasmid is 
designated pl6 kb,CS,PC. 

The transgene contained within plasmid p 16 kb, CS, PC 
2 0 is excised with NotI and used to generate transgenic 
bovine species as previously described. Such transgenic 
animals are capable of producing protein C in their 
milk. 



EXAMPLE 18 

25 Human Lactoferrin Transgene Formed by 

In Vivo Homologous Recombination: 
Microinjection of Two Overlapping DNA Fragments 

To obtain the entire hLF genomic clone, two human 
genomic cosmid libraries were screened using an hLF cDNA 
3 0 clone described herein as a probe. Of 14 clones 
isolated, 2 clones (designated 13.1 and 13.2; one from 
each human cosmid library) contained the entire hLF gene 




10 
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10 



15 



20 



25 



as determined by hybridization with primers specific for 
the first and last (17th) hLF exons. and by DNA 
sequencing. The insert sizes of these hLF genomic 
clones was 42 kbp for clone 13.1 and 43 kbp for clone 
13 . 2 . Clones 13 . 1 and 13 . 2 contain 5 kbp and 13 kbp of 
5 f flanking sequences, respectively. The 3' flanking 
region of clone 13.2 is between 1 kbp and 3 kbp; clone 
13.1 contains 7 kbp of additional 3 1 flanking sequence. 
The size of the structural hLF gene ( = introns + exons) 
is approximately 3 0 kb. 

The identity of the hLF clones was confirmed by 
sequencing several exons (incl. first and last) and 
comparing these sequences and the promoter region to the 
hLF cDNA sequence shown in Fig. 2. In addition, the 
clones were transfected into human kidney 293 cells and 
hLF expression was detected, indicating that both clones 
were functional. 

A comparison of the 13.1 and 13.2 clones (derived from 
independent libraries) by restriction mapping and 
Southern blotting revealed no differences in the 
corresponding regions (i.e. in the structural hLF gene) . 
Southern blotting experiments revealed that the hLF gene 
is a single copy gene in the human genome. Figs. 13-16 
illustrate the overall procedure for generating the aSl 
casein/genomic hLF transgene. 

The most 5 1 Apal site in the structural hLF gene is 
located in exon I, in the hLF signal sequence. The 400 
bp region immediately 5' of exon I was sequenced. This 
region contains the transcription initiation site of the 
hLF gene and a TATA-box. This region also includes a 
BamHI restriction site. 

To construct a mammary gland specific expression vector 
it was necessary to fuse the 8 (6.2) kbp or 16 (14.5) 
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kbp aSl bovine casein promoter region to the genomic hLF 
clone. However, the total size of such a construct, 
about 50 or 60 kb (6.2 or 14,5 kb from the casein gene 
promoter + 8 kb from the cosraid vector and 3 5-4 0 kb from 
5 the hLF genomic clone, i.e., about 50-63 kb) , renders 
the use of conventional cloning vectors difficult. 
Therefore, the 8 kbp or 16 kbp aSl 5' casein promoter 
and flanking sequence was fused to 9 kb of the 5 1 region 
of the structural hLF gene (Fig. ISA) and this fragment 
10 was coinjected with an overlapping hLF fragment 
containing about 33 to 34 kbp of the 3 1 sequence of 
generic hLF clone 13.1 obtained by Clal digestion. See 
Figs. 13 and 16. 

The BamHI fragment (containing exon I) from clone 13.2 

15 was subcloned into the plasmid pUC19 (Fig. 14) . From 
this clone, a 8.9 kbp Apal-Sall fragment was isolated 
by Apal (partial digest) and Sail digestion. This 
fragment lacks most of the hLF signal sequence and all 
of the hLF 5' UTR. A synthetic sequence (Fig. 15B) 

20 representing this missing region was obtained by 
synthesizing 2 complementary DNA strands (a 68-mer and 
a 62-mer) which runs from the 5' Apal site into the 
downstream region from the hLF TATA-box. After 
annealing these primers a DNA fragment was generated 

25 which has a 5' Clal overhang and a 3' Apal overhang. 
Subsequent sequencing of the cbl-ApcI fragment showed 
that it has the sequence given in Fig. 15B, which 
differs at one position from the native sequence. This 
synthetic Clal-Apal fragment and the 8 . 9 kbp Apal-Sall 

30 fragment described above were ligated into p-16kbCS and 
into a similar plasmid, containing 8 kbp instead of 16 
kbp of the aSl casein promoter. This yields two 
plasmids, containing 16 kbp or 8 kbp of bovine aSl 
casein promoter, fused to the 5' part (9 kbp) of the hLF 

3 5 genomic gene. See Fig. 15A. These fragments were cut 
out (Notl-Sall) and coinjected with the 3' 33 to 34 kbp 
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Clal fragment from hLF cosmid clone 13.1. 
coinjected fragments had an overlap of 5.4 kbp. 



The 



Upon coin ject ion of the constructs containing the 8 kbp 
aSl casein promoter, 8 independent transgenic mice were 
5 identified by tail-DNA blotting. To determine if 
homologous recombination had occurred, chromosomal DNA 
(from tails of founders and offspring) was digested with 
Apal and analyzed by Southern blotting. The 2.7 kb 
Clal-Mlul fragment (see Fig. 13 or Fig. 17) that is 

10 located in the overlap was used as a probe. When 
homologous recombination occurred, a band of 7.5 + 0*3 
= 7.8 kb is generated and detected with this probe. 
This band is also present in human chromosomal DNA, 
which was used as a control in the analysis. If 

15 homologous recombination has not occurred, the probe 
detects bands of varying size, depending on the location 
of Apal sites around the site of integration. 

The diagnostic 7.8 kb band was detected in all 8 
transgenic mouse lines, indicating that each transgenic 
20 mouse contained recombined fragments. For these 8 mouse 
lines (founder no's: 936, 937, 950, 951, 982, 983, 984 
and 985) , milk was collected from lactating females 
(founder and/or offspring) and assayed for hLF protein 
expression. The data on 7 mouse lines is shown below. 
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Mouse line 

936 

937 
950 
951 
982 
983 



10 984 
985 



Expression Level 
(max.) (mcrhLF/ml) 

4.5 

6.0 
0.003 
0.010 
5*9 

similar to 982 and 937 on 



day 2 and 4 of lactation* 



2.8 
6.6 



*: This mouse died (by accident) on day 4 of lactation. 
At this time, hLF expression had reached a level of 0.3 
mghLF/ml. This is exactly the level found for other 

15 high expressors (e.g. lines 937, 982, 984) at this early 
stage of lactation. This phenomenon of gradual increase 
of hLF expression at the beginning of, and in 
particular, the first lactation has been commonly 
observed by us in the mice generated herein. Therefore, 

20 mouse 983 is classified as a high level expressor. 



The tissue-specificity of hLF expression was determined 
by isolating total RNA from a large number of tissues 
and analyzing for the presence and levels of transgene 
derived mRNA. Based on this analysis, hLF mRNA only 
25 occurs in the lactating mammary gland and expression is 
tissue- and stage-specific. 

RNA levels were below the threshold of detection in 
lines 950 and 951, but were high in high expressing 
lines and correlated with bovine aSl casein expression 
3 0 levels. This was determined by Northern blot analysis 
of both bovine lactating mammary gland RNA and mammary 
gland RNA from lactating transgenic mice. A 24 bp 
synthetic oligomer which hybridizes to exactly the same 
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sequence in the 5' UTR of bovine aSl casein RNA and in 
the transgene RNA was used as a probe. Expression 
levels were compared directly by quantification of the 
amount of labelled probe hybridized to the transgene- 
5 and aSl RNA. When a correction was made for the size 
difference between bovine aSl casein (20 kD) and hLF (80 
kD) , the ratio of mRNA to protein was in the same range 
for bovine aSl casein and hLF ♦ This indicates that 
translation and secretion of the transgene derived hLF 

10 is not impaired. The length of the hLF mRNA was as 
expected (about 2.5 kb) but in mouse line 937 a longer 
band (3-3.5 kb) of slightly less intensity was also 
observed. The occurrence of this band may. be related 
to the homologous recombination process. It remains to 

15 be determined if this RNA translates into bona fine hHL. 

It has been suggested that casein promoters are less 
favorable for obtaining high level expression than other 
milk specific promoters. However, the present data show 
that this is not the case. With respect to both 
20 expression level and percentage of expressing animals, 
the transgenes containing aSl casein sequences perform 
better than any other mammary gland specific transgene 
reported . 

The above data compared to those obtained with 
25 constructs containing hLF cDNA provide the following 
observation. The best cDNA expression vector herein 
(16,8hLF3) always expresses at much lower levels as 
compared to the genomic hLF construct. Of 13 cDNA lines 
generated, 8 expressed at very low levels (1-5 /ig/ml) , 
30 5 expressed from 40 to 200 /ig/ml. These relatively low 
levels (although high for cDNA expression) as compared 
to that observed for genomic hLF (containing the same 
flanking sequences) indicate that genomic sequences 
produce consistently higher expression levels. 
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EXAMPLE 19 

Generation of Genomic Human Lactoferrin Transgenes 
bv Conventional Cosmid Ligation Techniques 

hLF genomic transgenes have also been generated by 
5 conventional ligations in cosmids. The first construct 
ShLFgen is similar to the transgene generated by 
coinjection, but contains the 3' Clal fragment from hLF 
clone 13.2. The size of this fragment is about 26- 
27 kb. The second construct 16hLFgen is identical to 
10 ShLFgen, but contains a larger stretch of aSl casein 
promoter sequences. 

Construction detail : 

The Notl-Mlul fragment from the construct depicted in 
Fig. ISA (referred to as 8hLFgen9k) was used to prepare 

15 the ShLFgen construct. This Notl-Mlul fragment contains 
the synthetic Clal-Apal fragment depicted in Fig. 15B. 
This synthetic sequence contains 24 bp of the hLF S^UTR 
and encodes for most of the hLF signal sequence (see 
Fig. 15C) . This Notl-Mlul was ligated with the 3 f Mlul- 

20 Clal fragment from clone 13.2 and a Clal-NotI linker as 
shown in Fig. 17. The cloning vector was cosmid pWE15 
cut with NotI, from which the internal Clal and Sail 
sites had been deleted. 

The first intron of the hLF gene is located 4 bp 
25 downstream of the Apal site in the signal sequence. As 
a result, the DNA sequence encoding the 19 aa signal 
sequence is partly located in exon 1 (43 bp, encoding 
14 aa and 1 codon partially) and in exon 2 (the first 
14 bp, encoding 4 aa and 1 codon partially) . The exact 
3 0 position of hLF intron 1 was determined by DNA 
sequencing and comparing the genomic sequence to the hLF 
cDNA sequence. The sequence upstream of the translation 
initiation site (355 bp, containing the hLF 5 1 UTR and 
5' flanking region) was also sequenced. 
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The hLF transcription initiation site was not included 
in the genomic hLF constructs as shown. Instead, they 
contain the bovine aSl casein gene transcription 
initiation site. Although the exact position of the hLF 
5 'cap' site has not been determined, it is probably 
located about 3 0 bp downstream of the 'TATA' box, as is 
the case for the vast majority of eukaryotic genes. In 
addition, for the mouse LF gene the transcription 
initiation site has been mapped (Shirsat, et al . (1992) 
10 Gene 110, 229-234; Liu and Teng (1991) J. of Biol. Chem . 
32 . 21880-21885) . On the basis of homology between the 
mLF and hLF 5 1 UTR, it is concluded that genomic hLF 
constructs herein do not contain the hLF transcription 
initiation site. 

15 The cDNA contains a Thr codon (ACA) at aa position 13 0 
(see Fig. 2) . The corresponding region in genomic hLF 
clones 13.1 and 13.2 (exon 4, plus parts of intron 3 and 
4) have been sequenced. These clones contain the 
sequence ATA, which encodes isoleucine. The cDNA also 

20 contains a Cys codon (TGC) at position 404 (see Fig. 2) . 
In hLF clones 13.1 and 13.2 this is a GGC, encoding 
glycine. 

By using the Notl-Mlul fragment from 16hLFgen9k instead 
of from 8hLFgen9k, 16hLFgen was generated. ( 

25 Construction of 8hLFgen37 : 

The 5' Notl-Mlul fragment from the construct 
depicted in Fig. 15A (called 8hLFgen9k) was ligated to 
the 3 1 Mlul-Clal fragment from clone 13.1, combined with 
a Clal-NotI linker (compare Fig. 17: read 13.1 instead 
30 of 13.2). The cloning vector was cosmid pWE15, from 
which the internal Clal and Sail sites had been deleted, 
cut with Notl. Prior to microinjection, vector 
sequences were removed via Notl digestion. 
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All constructs were cut from the vector using NotI, and 
microinjected. 

Expression data : 

Three mice containing ShLFgen and 5 mice containing 
5 16hLFgen were generated. Preliminary expression date 
in milk are as follows: 



-pi 



10 



Construct 


Line 


Max. hLF expression 
in milk fma/ml} 








8hLFgen 


1089 


0.95 




1252 


1.2 




14 01 


1.4 


16 hLFgen 


1112 


2.8 




1113 


ND 




1134 


0.3 




1185 


ND 




1191 


ND 


8hLF37 


1507 


4.1 




1556 


8.7 



ND = not done 
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EXAMPLE 2 0 
Bovine fiLG/Human Lactoferrin Transaenes 

The bovine /3LG-promoter (beta-lactoglobulin) was used 
to construct a transgene encoding for the expression of 
5 hLF. Briefly, the aSl promoter in the genomic hLF 
constructs 8hLFgen and 8hLFgen3 7 were replaced with the 
bovine )3LG-pr omoter . The resulting constructs are 
referred to as jSLG-hLFgen and /SLG-hLFgen37 . The overall 
strategy for these constructions are depicted in Figs. 
10 18-20, 

Isolation of the bovine gLG-promoter . 

The charon 2 8 phage clone X/3LG-13 , described by Silva 
et al., (1990) NucL Acids Res, 18 :3 051, was obtained 
from Dr. Carl A. Batt. This clone was isolated from a 
15 bovine genomic library by screening with a 0LG cDNA 
probe. It contains the structural )SLG gene and about 
8 kb of 5' flanking region. From this clone, a 4 . 3 kb 
EcoRI fragment was isolated and subcloned into plasmid 
pKUN5 using standard procedures (see Fig. 18) . 

2 0 From this plasmid, a 3.2 kb Notl-SacI fragment was 

isolated. The NotI site was derived from the poly linker 
of the cloning vector. The SacI site lies 15 bp 
downstream of the BLG transcription initiation site. 
A PvuII site is located five bp upstream of the 
25 translation initiation site. A fragment representing 
the region between the SacI and PvuII sites (including 
these sites) was generated by synthesizing and annealing 
the 3 0-mer and 37-mer DNA oligomers depicted in Fig. 18. 
This fragment also contains a Clal and a Sail site 

3 0 directly downstream of the PvuII site (Fig. 18) . The 

3.2 kb Notl-SacI fragment and the synthetic Sacl-Sall 
fragment were ligated into a pKUN plasmid (pKUNldeltaC) , 
from which the internal Clal site had previously been 
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removed by cutting with Clal and subsequent treatment 
of the cut vector with Klenow enzyme. This ligation 
resulted in plasmid pBLG3 . 2 * 

The 7 34 bp region directly upstream of the translation 
5 initiation site was sequenced and compared to the 
corresponding region of the published sequence of the 
sheep BLG promoter (see Fig. 24) . Overall homology was 
91%, indicating that the sheep- and bovine BLG-promoters 
are very similar. 

10 Generation of flLG-hLF constructs : 

The 8.9 Clal-Sall fragment from construct 8hLFgen9k 
(Example ISA) was isolated and cloned into p/3LG3 . 2 after 
cutting this vector with Clal and Sail. This ligation 
resulted in construct p)SLGhLFgen9k (Fig. 19) . From this 

15 construct the 9.4 kb Notl-Mlul fragment was isolated 
and, together with the 23-24 kb MluI-NotI fragment 
isolated from 8hLFgen, ligated into a NotI cut pWE15 
cosmid, resulting in p/3LG-hLFgen (Fig. 19) . The 34 kb 
NotI insert was isolated from the cosmid by NotI 

2 0 digestion and microinjected following standard 

procedures . 

For the generation of pjSLG-hLFgen37 the 9 . 4 kb Notl-Mlul 
fragment from pjSLGhLFgen9k was ligated with the 3 0 kb 
3 1 Mlul-Clal fragment from hLF clone 13.1, combined with 
25 a Clal-NotI linker into a NotI cut pWE15 cosmid vector. 

The £LG-hLFgen insert was isolated and microinjected 
following standard procedure. 

Expression data : . 

0LG-hLFgen (the shorter of the 2 constructs) was 

3 0 injected and 7 independent jc nouse line s were produced. 

Expression data for hLF product in milk is available for 
the following lines: 
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Construct 


Line 


Max, hLF expression 
in milk (ma/ml) 








pjLiL»— nijr gen 


11UO 


0 02 




1 1 n7 


1.9 




1108 

_I_ J- \J o 


0.8 




1110 


6.2 




1111 


1.3 




1155 


2.1 




1156 


2.2 


0LG-hLFgen3 7 


1591 


0.05 




1592 


27 




1593 


5.9 
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EXAMPLE 21 

Isolation of a Genomic hLF Fragment 
Containing Both the Structural Gene 
and the hLF Promoter 



5 HLF is normally expressed at relatively high levels (1- 
2 mg/ml) in human milk. To determine whether the hLF 
promoter can drive high level hLF expression in the milk 
of transgenic animals, the intact hLF gene under control 
of its own promoter was microinjected using standard 
10 procedures. 

Construction details : 

Two important points determined the construction route. 
Since the cosmid vector C2RB (Fig. 13) containing the 
genomic hLF clones does not contain unique restriction 

15 sites flanking the hLF insert, the intact insert could 
not be isolated directly from this cosmid. It was 
desirable to include all 5 1 and 3 f flanking sequences 
present in hLF clones 13.1 and 13.2 into the transgene. 
Since clone 13.2 (Fig. 13) contains the most 5' flanking 

20 sequences (13 kb) and clone 13.1 the most 3 1 flanking 
sequences (7 kb more than 13.2) , the 5' part of 13.2 was 
combined with the 3* part of 13.1. 

The cosmid 13.2 was linearized at the Pvul site 0 . 5 to 
0.8 kb upstream of the 5' region of the hLF insert (Fig. 

25 20) and subsequently treated with the exonuclease Bal31, 
thereby removing approximately 1 kb of cosmid and 0.2 
to 0.5 of 5 1 hLF sequence. Subsequently, the DNA was 
treated with T4 polymerase to create blunt ends and cut 
with Mlul. The approximately 19 kb (12.5 5' flanking 

3 0 sequences + 6.2 kb hLF gene) blunt end-MluI cut plasmid 
vector (pKUN6deltaCla, Smal-Mlul) , resulted in plasmid 
phLFS'M gene 37. This plasmid contains a NotI site 
directly 5' of the Smal site. From this plasmid, the 
19 kb Notl-Mlul fragment was isolated and ligated with 
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th e 30 kb MluI-NotI 3' fragment from construct 8hLFgen37 
into a NotI cut pWE15 cosmid, resulting in p5 , hLFgen37 
(Fig. 20) . 

The 49 kb NotI insert was isolated and microinjected 
5 following standard procedures. 

Expression data on construct p5 1 hlLFaen37 : 

Eight independent founder mice have been generated for 
the p5 1 hLFgen3 7 construct; expression data are available 
for 6 lines. 



10 



15 



Construct 


Line 


Max. hLF expression 
in milk Cma/mll 


p5"hLFaen37 


1491 


ND 




1492 


2.5 




1493 


4.2 




1495 


6.5 




1496 


18 




1497 


ND 




1506 


6.3 




1551 


6.4 



ND = not done 



EXAMPLE 2 2 



Generation of Mammary Gland 
Specific hLZ Expression Cassettes 



The structure and sequence of the human lysozyme gene 
has been described (Peters, et al. (1989) Eur. J. Biochem 
20 1JL2 : 507-516) . The structural hLZ gene contains 4 exons 
and is 5.3 kb in size. 

Using a 91-mer synthetic DNA sequence complementary to 
part of exon 2 of the hLZ gene as a probe, several 
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independent hLZ clones were isolated from a human 
genomic phage library . The clone X7 . 2 . 1 contains 14 kb 
insert comprising 8.7 kb of 5' flanking sequences and 
5-3 kb of the genomic hLZ gene. Exon 4 is only partly 
5 included: clone \7 . 2 . 1 stops at one of the Sau3A sites 
at position 5333 and 5350 (numbering according to 
Peters, et al., op. cit.). The region downstream of 
position 5333/5350 (532 or 549 bp of exon 4 sequences) 
is missing. These sequences are non-coding and 
10 represent part of the 3' UTR of the hLZ gene. All hLZ 
coding sequences are present in \7 . 2 • 1 • 

Expression Vector 16.8hLZ 

The design of expression vector 16,8hLZ, shown in Fig. 
21, is as follows. The 5' flanking region (including 
15 the promoter) of the hLZ gene was removed and replaced 
with the bovine aSl casein gene promoter by subcloning 
into the plasmid p-16kbCS which is described in Example 
16. The fusion site is located in the 5 1 UTR of the hLZ 
gene (exon 1) , such that in addition to 2 3 bp of casein 

2 0 5 1 UTR most of the hLZ 5 1 UTR is present. All coding 

sequences in this construct, including the signal 
sequence, are derived from hLZ clone X7.2.1 (Fig. 23A) . 

The 3* UTR of the hLZ gene in clone X7 . 2 . 1 was fused to 
the 3 'UTR + flanking region of the bovine aSl casein 
25 gene described previously. The resulting 3' UTR of 
construct 16,8hLZ is therefore derived partly from the 
hLZ gene (exon 4, running from bp 4761 to bp 5333/5350) 
and partly (including par of exon 8 and all of exon 9) 
from the bovine aSl casein gene. The 3 ■ flanking region 

3 0 (8 kb) is derived entirely from the bovine aSl casein 

gene . 
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Construction details for 16,8hLZ : 
16,hLZ : 

The 6 bp directly 5 1 to the AUG codon in hLZ exon 1 
constitute a Hindi site, A Sail phage poly linker site 
5 is located directly 3 % of the X7 . 2 . 1 insert. These 
sites were used to isolate a 5.3 kb HincII-Sall insert 
(Fig. 23) . The sequence running from +3 (relative to 
the transcription initiation site at +1) to the Hindi 
site, was synthesized by annealing the 31-mer and 3 5-mer 

10 depicted in Fig. 23A. The resulting synthetic DNA 
fragment has artificial 5' KpnI-HincII fragment and the 
5.3 kb HincII-Sall fragment were subcloned into a Kpnl- 
Sall cut pKUN-1 plasmid (Fig. 2 3A) . From the resulting 
9.3 kb plasmid (pKHLys3 '5.3) the 5.3 kb Clal-Sall 

15 fragment was isolated and subcloned into a Cla-Sall cut 
p-0.7kbCS plasmid (the equivalent of p-16CS but 
containing less 5' flanking sequences), resulting in 
pKhLZ0.7. 

The 8 kb bovine aSl casein gene EcoRI fragment 
20 containing the 3 1 casein UTR and ca 6.6 kb of flanking 
sequences, was isolated from plasmid pKE3 • E10 (described 
previously) as an 8 kb 5 " -XhoI-SalI-3 1 fragment (Fig. 
23B) . This fragment was subcloned into the Sail site 
of pKhLZO.7, resulting in p0.7,8hLZ. After this, the 
25 Sail site of p0.7,8hLZ was replaced with a NotI site by 
insertion of linker S1/S2 (Fig. 23C) , yielding plasmid 
p0.7,8hLZNt (Fig. 23D) . From this plasmid, the 13.3 kb 
Clal-NotI fragment was isolated and ligated with the 
14.5 kn Notl-Clal fragment from p-16CS into a NotI cut 
3 0 pWE15 cosmid (Fig. 2 3E) . From the resulting construct 
(named 16,8hLZ in Fig. 23E) the 27.8 kb NotI insert was 
isolated, purified and microinjected into murine and 
bovine zygotes following standard procedures. 




# 
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Ex pression Vector 16,8hLZ3 

The design of expression vector 16,8hLZ3, shown in Fig. 
22, is as follows . Previously described expression 
vector 16,8hLF3 was used in the construction of 
5 16,8hLZ3. The vector 16,8hLZ3 contains not only the 
bovine oeSl casein gene promoter, but also the complete 
first exon and part of the first intron of the bovine 
aSl gene. In addition, it contains part of the first 
intron plus the splice acceptor site of an 

10 immunoglobulin gene. The signal sequence and part of 
the 3 1 UTR and the complete 3 • flanking region are also 
derived from the bovine aSl casein gene. The hLF cDNA 
and the aSl casein signal sequence are excised from this 
vector by Clal-Sall double digestion. The Clal site is 

15 located 5 bp 5' to the translation initiation codon. 

An 5.3 kb Clal-Sall hLZ fragment was isolated from 
plasmid pKhLZ0.7 and subcloned into a Clal-Sall cut 
16,8hLF3 vector from which the hLF cDNA had been removed 
by Clal-Sall double digestion. 

20 The 16,8hLZ expression cassette vector sequence was 



removed by NotI digestion, subsequently purified 
according to standard procedures and microinjected into 
mouse zygotes . 

Expression data ; 
25 Construct 16 . 8hLZ : 

Seven transgenic mice were generated for construct 
16,8hLZ. Expression data are available for 6 

independent mouse lines (data from lactating offspring, 
using our standard hLZ assay on milk samples) . 
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Construct 


Line 


Max. hLF expression 
in milk (ma/ml) 








16,8 hLZ 


645 


10 




647 


0,7 




661 


260 




662 


7.4 




1069 


60 




1070 


28 



The above data illustrates that 16,8hLZ expresses at 
relatively high levels. In human milk, hLZ levels are 
5 only 50 jig/ml (max) . Since hLZ is a 15 kD protein, a 
level of 0.26 mg/ml hLZ compares to ~ 1.3 mg/ml of hLF 
(hLF is 80 kD) . 

Construct 16,8hLZ3: 

Four independent transgenic mice were generated for 
10 covalent 16,8hLZ3. The following expression data are 
available from mouse lines 905 and 907. 



Mouse line 

905 

907 



15 The data show that 16,8hLZ3 can be expressed at 
relatively high levels (0.36 mg/ml compares to - 1.8 
mg/ml hLF) . However, as also shown, 16,8hLZ3 does not 
always express at high levels. Although the number of 
mice analyzed is very low, constructs 16,8hLZ and 

20 16,8hLZ3 seem to behave more or less similar with regard 
to frequency of expression and expression levels. It 
should be noted, however, that another 7 lines of mice 
transgenic for 16,8hLZ also contain the 16,8hLF3 
construct. (See below.) None of these lines expressed 




Expression (/xg/ml) (max) 
475 
10 
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as high as 0.3 6 mg/ml. Therefore, 16,8hLZ3 appears to 
be a more efficient construct then 16,8hLZ. This could 
be caused by the heterologous splice site (which does 
enhance hLF cDNA expression levels) . 



5 EXAMPLE 2 3 

Transgenic Mice Containing Transgenes 
Encoding Genomic hLZ and hLF cDNA 

Coiniection of 16,8hLF3 and 16.8hLZ 

To assess the feasibility of simultaneously expressing 
10 hLF and hLZ in the milk of transgenic animals, the 
appropriate isolated and purified 16,8hLF3 and 16,8hLZ 
constructs were coinjected into murine zygotes. 

Seven independent mouse lines transgenic for both 
15 constructs were generated. The expression data 
available for each line are as follows: 

hLF expression 
fug/ml) 

500-2000 (max: 2100) 

1-9 

1-4 . 3 

1-15 

1 

0.1 

300-1260 



Conclusions : 

Only line 649 (1/7) expresses hLZ at relatively high 
levels. Line 64 9 and mouse line 660 (2/7) express high 
levels of hLF. 

3 0 Comparison to data obtained 

from single construct injections : 

For 16.8hLZ : 



20 




25 



Mouse 
line 


hLZ expression 
(fiq/ml) 


649 


150-250 (max: 


650 


10-30 


651 


1-2.5 


657 


1-6 


658 


0.5 


659 


<0. 1 


660 


5-25 
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The hLZ expression level of mouse line 64 9, coinjected 
with the 16,8hLF3 and 16,8hLZ expression cassettes is 
comparable to that of line 661 injected only with 
16, 8hLZ. 

5 In most cases, high level expression of hLZ is not 
obtained upon coinjection (1/7: high expression (line 
649) ; 2/7 (650 and 660) : intermediate-low expression; 
4/7: low expression). Upon injection of single hLZ 
transgene, similar data are obtained (1/4: high 
10 expression (line 661); 1/4: intermediate; 2/4: low 
expression). Therefore, behavior of- the l6,8hLZ 
transgene is not measurably influenced by the presence 
of the 16,8hLF3 transgene. 

Note that none of the 7 lines expressed as high as line 
15 905 (construct 16,8hLZ3), although the level of 649 is 
in the same range. 

In conclusion, these constructs can be expressed at 
relatively high levels (0.2-0.5 mg/ml range) with 
approximately 2 0-25 % of the resulting transgenic mice 
20 expressing at these high levels (3/13; 7 coinjections 
+ 6 single in j . ) . Also coinjection with 16,8hLF3 does 
not appear to influence expression of 16,8hLZ. 

For 16.8hLF3 : 

The single injection of 16,8hLF3 resulted in 13 
25 independent transgenic mouse lines which can be divided 
into 2 groups : 

(1) the low expressors which produced levels are 
from 0.1 to 5 ng/vxl (8/13) and 

(2) the high expressors which produced levels from 
30 40 to 200 /xg/ml (5/13). 

Of the mice having the coinjected fragments, 2/7 express 
at high levels. This is similar to the frequency of 
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high level expression observed upon injection of one 
fragment (5/13). However, both 16 , 8hLF3/16, 8hLZ mice 
lines (649 and 660) express hLF at much higher levels 
than observed previously. This indicates that the 
5 presence of the hLZ construct stimulates expression of 
the 16,8hLF3 construct. In line 649, the high hLF 
levels are accompanied by high hLZ levels. For line 
660, this is less clear as hLZ levels are intermediate. 
However, as illustrated below, RNA analysis reveals that 
10 the 16,8hLZ transgene in line 660 is transcriptionally 
at least as active as the hLF transgene. 

Results from expression 
analysis at the mRNA level : 

Northern blot analysis was performed on both bovine 
15 lactating mammary gland total RNA and mammary gland 
total RNA from lactating transgenic mice (including mice 
transgenic for genomic hLF, 16,8hLF3 and 16,8hLF3 + 
16,8hLZ). A 24 bp synthetic oligomer which hybridizes 
to exactly the same sequence in the 5* UTR of bovine crSl 

2 0 casein RNA and in all transgene derived RNA was used as 

a probe. Expression levels were compared directly by 
quantification of the amount of labelled probe 
hybridized to the transgene- and bovine aSl RNA. 

It appeared that the ratio of hLZ- to hLF-mRNA and of 
25 hLZ- to bovine aSl mRNA was much higher than expected 
from the hLZ- and hLF protein levels. For example, line 
649 expressed -0.2 mg/ml of hLZ and -1-2 mg/ml of hLF. 
After correcting for protein size (factor 5) , hLZ and 
hLF mRNA levels are expected to be within the same 

3 0 range, with hLF levels about 2 -fold higher than hLZ RNA 

levels. However, in line 649 hLZ mRNA levels were 2 0- 
fold higher than the hLF mRNA levels. Comparative RNA 
analysis of lines 650, 661, 662 and bovine mammary gland 
RNA confirmed these data. 
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It can therefore be concluded that transcriptionally, 
very high levels of hLZ expression are obtained using 
genomic hLZ sequences and the bovine aSl casein gene 
based expression system of the invention. The genomic 
5 hLZ constructs are transcribed at much higher levels 
than the hLF cDNA constructs, and expressed in the same 
range as the genomic hLF transgenes. 

To compare the performance of different hLF and hLZ 
transgenes at the translational level, a 20-fold 

10 correction should be made. The transcriptional activity 
of hLZ transgenes expressing at 0.25 mg/ml is comparable 
to a protein level of 5 mg/ml, a level of 50 ng/ml 
equivalent to 1 mg/ml. In addition, mouse line 649 hLZ 
mRNA levels exceeded bovine ceSl mRNA levels - which had 

15 been 10-fold diluted - several fold. Since bovine oeS - 
casein is expressed at - 12 mg/ml (and is of similar 
size as hLZ) , these hLZ RNA levels would be equivalent 
to an expression level of several mg/ml. 

EXAMPLE 2 4 

2 0 Generation of 16,8 A hLZ: 

Construct 16,8 A hLZ3 is a derivative of 16,8 
hLZ 3. In 16 .8 A hLZ 3 the hLZ 5 1 UTR sequences and the 
hLZ signal sequence have been replaced with the 
corresponding sequences from the bovine aSl-casein gene. 
25 Construct 16, A hLZ3 is a derivative of 16,8 

A hLZ3. In 16. A hLZ3 the bovine ocSl-casein gene 3 1 UTR 
and flanking sequences have been replaced with the hLZ3 1 
UTR and 4.5 kb of hLZ3" flanking sequences. 

Construction details: 

3 0 Vector p07 . 8hLZ (Fig. 2 3B) was digested with 

Clal and Sail. The 4.7 kb fragment (comprising 0.7 kb 
of the aSl-casein 5 • flanking sequences and the plasmid 
vector) was isolated and ligated to linker GP 278/279 
(Fig. 25) . This DNA sequence comprises part of the 
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bovine aSl-casein 5' UTR, the complete bovine aSl-casein 
signal sequence and 25 bp of hLZ sequence, encoding the 
N-terminal region of adult hLZ. The ligation product 
was isolated and ligated to a 5.3 kb Ball-Sail fragment 
5 from pKHLys3'5.3 (which is depicted in Fig, 23A) . The 
resulting constructs is pO^AhLZAS 1 . From this 
construct the 5.3 kb Clal-Sall fragment was isolated and 
inserted into a Clal-Sall vector, derived from pl6,8hLF3 
(also used in construction of 16,8 hLZ3) . The resulting 

10 construct is designated pl6,8A hLZ3 (Fig. 26). 

For construction of 16, A hLZ3 the vector 
p0.7AhLZA3 l was digested with Xbal and Sail, and a Xbal- 
Notl-Sall linker was inserted (Fig. 27) . This vector 
was linearized with Xbal, and the 6.5 kb Xbal fragment 

15 from XHLYS1 (described by Peters et al. , Eur. J. 
Biochem. 182, 507-516, , 1989) was inserted in the sense 
orientation. This resulted in vector p0.7AhLZ. From 
this vector, the 9.8 kb Clal-Notl hLZ fragment was 
isolated and, together with the 14.5 kb Notl-Clal 

20 fragment from pl6.8hLZ3, inserted into the Notl digested 
pWE15 cosmid vector. 

In both cases, the transgene without plasmid 
sequences was isolated as a Notl fragment (16, 8A hLZ3 ; 
27.8 kb; 16,AhLZ3; 24.3 kb) , purified and microinjected 

25 into fertilized mouse oocytes following standard 
procedures . 

Four independent transgenic founder mice have 
been generated with construct 16,8 A hLZ3 and 6 mice 
were generated with construct 16 A hLZ3 . 
30 Preliminary expression data: 

Mouse line max. hLZ expression 

(/xg/ml in milk) 

16,8 A hLZ3: 1711 56 

1783 20 

35 16 A hLZ3: 1806 '267 

1809 2400 





-117- 

From these results it can be concluded that 
construct 16 A hLZ3 yields much higher expression levels 
than any other hLZ construct tested. 

Preliminary quantitative Northern blotting 
5 data combined with data on hLZ protein expression levels 
indicate that the discrepancy between RNA and protein 
levels as observed for constructs 16,8hLZ and 16,8hLZ3 
does not occur with construct 16 A hLZ3. 

EXAMPLE 25 

10 Transmission experiments with "Calf #4 

Three heifers were super-ovulated using normal 
procedures used in cattle breeding (described in 
Diekman, S.J. et al. (1989) Theriogenology 31 : 473-487) . 
These animals were subsequently inseminated with sperm 

15 from "calf #4" as described in Example 15. Calf #4 was 
judged to be transgenic as described in Example 15. The 
insemination resulted in two pregnancies. 

These two animals were slaughtered four weeks after 
insemination and the embryos recovered from the uterus. 

2 0 Total DNA was isolated from these embryos following 
procedures as described in Maniatis et al. (1982), 
digested with EcoRI, and analyzed by "Southern , Blot 
technique". The blot was hybridized to a probe specific 
for the hLF gene (same protocol as in example 15) . Of 

25 the 12 embryos recovered 6 (50%) showed an hLF-specif ic 
band. In all cases the band was of the expected size 
and intensity. This indicates that: 

(a) the transgene transmits with an efficiency of 
appr. 50% 

30 (b) the copy number is the same as in the founder 

(-3) 

(c) no gross rearrangements have occurred during 
transmission 
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Of the six transgenic embryos, five were male and 
one was female according to a PCR-analysis with primers 
specific for a bovine Y- chromosome repeat. These data 
demonstrate that the transgene can be transmitted to 
both males and females and has not integrated in the Y- 
chromosome. 

The sequences of the Y-chromosome specific 
primers are:>^3 TTV/fe& X^^^tT) 
Forward primer: 5 ■ -GGA TCC GAG ACA CAG AAC AGG-3 1 
10 Reverse primer: S'-GCT AAT CCA TCC ATC CTA TAG- 3 1 



EXAMPLE 2 6 

Expression of recombinant proteins in saliva of calves 

Ten animals were born from oocytes co-injected with 
the hLF transgene (as described in Example 18) and a hLZ 
15 transgene (16,8hLZ; described in Example 22). None of 
these animals appeared transgenic as judged by Southern 
Blot, but four of them (all males) were judged to be 
mosaic based on PCR with 0.5 /xg DNA from blood and ear. 
Primers for this PCR-experiment were located in exon 8_ /szdLn 
2 0 of the hLF gene. The sequences of the primers 4r^^^^^ ' 
-5'-TTT GGA AAG GAC AAG TCA CCG-3 ' and 
5'-CTC ACT TTT CCT CAA GTT CTG-3 ' 



All ten animals were tested for hLF and hLZ 
expression in saliva. Epithelial cells in the salivary 

25 gland are structurally and functionally similar to such 
cells in the mammary gland, and some milk protein genes 
may also be expressed in salivary gland (albeit at much 
lower levels than in mammary gland) . 

Approximately 2 ml of saliva was collected from 

3 0 the mouth of the animal and levels of protein were 
determined in these samples using a radioimmunoassay as 
described in Example 5. Of the ten animals, three showed 
expression of hLF above the lower limit of detection. 



$ i 
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All three animals were part of the group of four animals 
judged to be mosaic* 

sample 2 (ng/ml) 
18 

1.4 
nd 



All 10 animals were also tested for hLZ expression. 
10 Only animal 9772 showed expression of hLZ in saliva. 

The amount detected was 2 ng/ml. 

Of the 21 animals born in the experiment described 

in Example 15, one animal (male) was judged to be mosaic 

based on the fact that it was immunotolerant for hLF. 
15 This animal showed an hLF expression in saliva of 100 

hg/ml. 

These data show that the transgenes used are 
capable of expressing hLF (and hLZ) in bovines. 

Having described the preferred embodiments of the 
20 present invention, it will appear to those ordinarily 
skilled in the art that various modifications may be 
made to the disclosed embodiments, and that such 
modifications are intended to be within the scope of the 
present invention . 

25 All references cited herein are expressly incorporated 
in their entirety by reference for all purposes. 




Expression levels were as follows: 
animal sample 1 (ng/ml) 

5 9772 25 

9773 3 

9774 1.2 



nd = not determined 



(7^ 



